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\  Abstract 

•V 

Recently  proved  theorems  concerning  weak  convergence  of  non-Markovian 
processes  to  diffusions,  together  with  an  averaging  and  a  stability  method,  are 
applied  to  two  (learning  or  adaptive}  processes  of  current  interest:  (1)  an 
automata  model  for  route  selection  in  telephone  traffic  routing,  (2)  an 
adaptive  quantizer  for  use  in  the  transmission  of  random  signals  in  communica¬ 
tion  theory.  TOe  models  are  chosen  because  they  are  prototypes  of  a  large 
class  to  which  the  methods  can  be  applied.  TOe  technique  of  application  of 
the  basic  theorems  to  such  processes  is  developed.  Suitably  interpolated  and 
normalized  ^ learning  or  adaptive ♦  processes  converge  weakly  to  a  diffusion, 
as  the  ^learning  or  adaptation*  rate  goes  to  zero.  For  small  learning  rate, 
the  qualitative  properties  (e.g.,  asymptotic  (large-time)  variances  and  para¬ 
metric  dependence]  of  the  processes  can  be  determined  from  the  properties  of 
the  limit.  . 
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I .  INTRODUCTION 

References  [7],  tH  develop  a  useful  method  to  study  the  asymptotic  proper¬ 
ties  as  t  +  0  and  nc  <  T  <  “  for  any  real  T  of  solutions  to  stochastic  difference 

equations  of  the  form 

(1-1)  Yn+1  =  Yn  +  +  4  gE(Yn'0  +  oU)  '  Yn  6  ^ 

where  the  distributions  of  the  random  sequence  {£^1  might  depend  on  the  { Y n } . 
Such  equations  occur  frequently  in  applications.  The  methods  in  II]  also  work 
when  e  is  replaced  by  a  sequence  t  ■*  0  as  n  -*  00  from  which  asymptotic  proper¬ 
ties  (rates  of  convergence)  of  various  forms  of  stochastic  approximations  can 
be  obtained. 

The  emphasis  in  [1]  (an  application  of  [7])  concerned  the  case  where  the 

h^  and  gf  are  smooth,  and  no  details  for  the  non-smooth  case  or  its  applications 

were  given,  nor  was  the  asymptotic  case  where  n  then  c  -+  0  treated.  This 

is  a  deficiency,  since  in  many  applications  in  communication,  control  and 

automata  theory,  the  h^  and  g^  might  simply  be  indicator  functions  and  the 

noise  {£  }  depend  on  {yr},  and  the  asymptotic  properties  (as  n  ■*  «»,  then  c  ■*  0) 
n  n 

desired.  Here,  we  apply  the  basic  results  of  [7]  to  two  such  problems.  The 
two  problems  have  current  technological  importance  in  their  own  right  and  each 
has  been  the  subject  of  a  great  deal  of  work.  Our  method  often  yields  a  complete 
analysis  of  the  asymptotic  properties  under  realistic  conditions.  The  two 
problems  are  typical  of  a  wi< e  class,  and  they  illustrate  the  power  and 
applicability  of  the  general  technique,  as  well  as  the  method  of  applying  it 
to  concrete  problems.  In  a  sense  the  method  is  an  extension  with  more  complex 
memory  structure  of  the  sort  of  "slow  learning"  results  obtained  by  Norman  (9), 
and  should  have  broad  applications  to  the  areas  cited  above. 
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The  basic  type  of  result  is  the  following.  Define  Y  (•),  t  £  10,®),  by 
£  £  £  £ 

Y  (0)  =  Yq  and  Y  (t)  *  Y^  on  [ie,ie+e) .  Under  appropriate  conditions, 

Theorem  1  gives  weak  convergence  of  {Yl(*))  in  Dr(0f®)  to  a  particular  diffusion 

process,  as  e  +  0.  Now,  let  { n£. }  denote  a  sequence  of  integers  tending  to  “ 

as  e  0.  For  t  ■>  0,  define  Y  (t)  -  Y  (t+tne)  .  The  tilde  always  denotes  a 

shift  by  n£  (discrete  parameter)  or  en^  (continuous  parameter) .  By  using 

Theorem  1  but  starting  {Y  }  at  time  n  instead  of  at  time  0,  we  will  get  a 

n  e 

great  deal  of  information  on  the  asymptotic  properties  (large  n,  small  e) . 

The  next  section  gives  some  background  material  from  [7] .  Sections  111  to  VI 
treat  a  learning  automata  approach  to  certain  problems  in  adaptive  routing  of 
telephone  calls  l2]-[3].  The  second  problem,  in  Sections  VII-VII1,  concerns 
the  asymptotic  theory  of  an  adaptive  quantizer  from  communications  applications 
[4],  15]. 
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II.  SOME  BACKGROUND  MATERIAL 
r  r 

D  [0,®)  denotes  the  space  of  R  -valued  functions  on  (0,®)  which  are  right- 
continuous  and  have  left-hand  limits,  and  is  endowed  with  the  Skorokhod  topology 
16].  denotes  the  continuous  functions  on  Rr  *  lO,®)  with  compact  support 

*a  C 

and  %  •  the  subset  whose  mixed  partial  derivatives  up  to  order  «  in  t  and  0 
in  the  components  of  x  are  continuous.  Let  b^(*,*),  a^  (*,*),  i,j  <_  r,  be  con- 
tinuous  functions  on  R  x  [0,®) .  Let  the  operator 


=  l  b^x.t) 

i 


3x. 

l 


l 

i#  j 


a^tXjt) 


3x. 9x , 
i  j 


be  the  infinitesimal  operator  of  a  diffusion  process  X(*)  .  Assume  that  the 
solution  to  the  martingale  problem  (on  Dr[0,®))  of  Strook  and  Varadhan  [8] 
corresponding  to  A  has  a  unique  non-explosive  solution  for  each  initial  con¬ 
dition. 

Let  bN<‘)  denote  a  function  with  values  in  [0,1],  equal  to  1  on  SN  = 

{x:  |x|<N),  equal  to  zero  in  Rr-S„.,  and  with  second  derivatives  bounded 

N+l 

£  N 

uniformly  in  x  and  N.  Define  {y  '  ,  n>0)  by 


(2.1)  Y*;"  «  ye,N  +  [eh  (Y^'N,^)  +  /eg  (YE'N,eE)  +  o(e) )b  (YE'N> , 

n+i  n  enn  enn  Nn 


,e,N 


y0  if 


|Y  |  <_  N  and  is  zero  otherwise, 


and  define  YE,N{*)  analogously  to  Ye ( - ) -  For  purely  technical  reasons,  it  is 

£  N  N 

convenient  to  state  the  theorem  in  terms  of  (Y  '  }.  Let  A  be  the  infinitesimal 

n 

u 

operator  of  a  (not  necessarily  unique)  diffusion  process,  denoted  by  X  (•), 
and  suppose  that  its  coefficients  a^ (•,•),  b^f*,*)  *re  continuous,  bounded. 
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c  N 

have  compact  support  and  equal  a(‘,0,  b(*,*)  in  S^.  Suppose  that  {Y  '  (')} 
converges  weakly  to  some  such  X^(*)  as  t  ■*  0,  for  each  N.  Then  [7]  (Y  (•) ) 
converges  weakly  to  X(«)  as  n  +  «,  The  following  theorem  is  a  restatement  of 
Theorem  3  of  [7]  with  t  =  e  .  Theorem  2  of  [7]  provides  a  very  convenient 
method  of  proving  tightness,  and  we  will  use  it  in  the  sequel.  Let  E^'N  denote 
expectation  conditioned  on  (YE'N,j<h,  j<n). 

Theorem  1 .  Assume  the  conditions  stated  above  on  the  solution  to  the  martingale 

r  N  N 

problem  on  D  [0,<**)  corresponding  to  operator  A,  and  on  A  and  X  (•)  .  For  each 

N,  and  f ( • , • ) €  •&,  a  dense  set  (sup  norm)  in  ,  let  there  be  a  sequence 

{f  '  (•)}  satisfying  the  following  conditions;  it  is  constant  on  each  interval 

[ne,ne+e) ,  at  ne  it  is  measurable  with  respect  to  the  o-algebra  induced  by 

{V-N,  j<n,£E,  j<n)  and 

(2.2)  sup  E|fe,N(ne) |  +  sup  ^  E|EE'Nfe'N(ne+e)-fe'N(ne) |  <  °°  , 
n,e  n,e 

and  as  e  -*■  0  and  for  each  t  as  ne  -►  t. 


(2.3)  E|fe'N(ne)  -  f(YE,w,ne)  j  -  0, 

n 


(2.4) 


E  f  (ne+e)-f  (ne) 


(|r  +  AN)f(YE,N,ne) |  *  0. 
o  t  n 


Then,  if  (YE,N(*) ,  eQ>e>0}  is  tight  in  Dr(0,«)  for  each  N,  where  eQ  does  not 
depend  on  N  and  Y  (0)  converges  weakly  to  X(0) ,  {y  ( •) }  converges  weakly  to 
X{  •) ,  the  unique  aolutioft  to  the  martingale  problem  with  initial  condition  X(0) . 
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III.  AN  AUTOMATA  PROBLEM  -  INTRODUCTION 

Narendra  [2],  [3]  and  others  have  studied  the  application  of  automata  and 
learning  theory  to  problems  in  the  routing  of  telephone  calls  through  a  multi- 
node  network  and  have  suggested  a  variety  of  interesting  automata  models  for 
this  application.  Under  various  assumptions  (both  explicit  and  implicit)  they 
have  stated  convergence  results  in  a  number  of  cases.  Generally,  their  results 
are  applications  of  Norman's  (9]  results  on  slow  learning.  Here,  we  take  one 
of  their  models  and  show  how  to  apply  Theorem  1  to  get  a  much  more  complete 
asymptotic  theory  (large  time)  for  small  rate  of  change  of  the  automata  behavior 
(e) ,  and  under  more  realistic  conditions.  The  case  dealt  with  here  can  readily 
be  generalized  -  as  will  be  commented  on  below.  The  example  illustrates  the 
power  and  usefulness  of  the  approximation  techniques  used  here.  The  algorithm 
should  be  considered  as  a  prototype.  It  might  not  be  the  best,  but  it  well 
serves  to  illustrate  the  method. 

The  problem  formulation.  Calls  arrive  at  a  transmitting  or  switching 
terminal  at  random  at  discrete  time  instants  n  -  0,1,2,...,  with  P{one  call 
arrives  at  nth  instant)  *  u,  u  £  (0,1),  P{>1  call  arrives  at  nth  instant)  *  0. 
From  the  terminal,  there  are  two  possible  routings  to  the  destination,  route 
1  and  route  2,  the  ith  route  having  N^  independent  lines  -  and  can  thus  handle 
up  to  N  calls  simultaneously.  Let  [n,n+l)  denote  the  nth  interval  of  time. 

The  duration  of  each  call  is  a  random  variable  with  a  geometric  distribution: 
P(call  completed  in  the  (n+l)st  interval | uncompleted  at  end  of  nth  interval, 
route  i  used)  ■  A^,  A^  €  (0,1) .  The  members  of  the  double  sequence  of  the 
interarrival  times  and  call  durations  are  mutually  independent.  It  is  possible 
to  work  with  more  general  Markovian  arrival  processes,  but  we  retain  a  simple 
structure  in  order  to  emphasize  the  main  points.  In  practice,  a  more  complex 
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network  would  occur  -  and  perhaps  cycles  might  exist,  and  a  vector  routing 

parameter  would  be  used,  one  component  per  node.  But  the  main  idea  is  similar. 

As  in  Theorem  4,  the  average  dynamics  are  used  for  the  stability  analysis. 

From  that  point  on,  the  proof  of  the  appropriate  generalization  of  Theorem  5 

would  be  quite  similar  to  the  proof  of  Theorem  5. 

The  parameter  e  will  be  used  for  the  rate  of  adjustment  of  the  routing 

automaton  -  the  device  which  selects  the  route.  The  adjustment  mechanism  will 

be  defined  later.  The  routing  automaton  operates  as  follows.  For  each  fixed 

e,  let  {y  }  denote  a  sequence  of  random  variables  -  with  values  in  [0,1].  In 
n 

order  to  have  an  unambiguous  sequencing  of  events,  suppose  that  the  calls  ter¬ 
minating  in  the  nth  interval  actually  terminate  at  time  n+-|,  and  arrivals  and 
route  assignments  are  at  the  instants  0,1,2,...  precisely.  Thus  the  state  at 
time  (n+1)  does  not  include  the  calls  just  terminated  or  calls  arriving  at  (n+1) . 

Define  the  "route  occupancy  process"  =  (X^'*,X^'^),  where  X^'*  is  the  number 

+  c ,  i 

of  lines  of  route  i  occupied  at  time  n  .  Thus,  X^  -  Ni'  a  call  arrives  at 

instant  n*l,  the  automaton  "flips  a  coin",  and  chooses  route  1  with  probability 

£  £ 
y^  and  chooses  route  2  with  probability  (1-y^) •  all  lines  of  the  chosen  route 

i  are  occupied  at  instant  (n+1)  ,  then  the  call  is  switched  to  route  j  ( j  +  i) . 

If  all  lines  of  route  j  are  also  occupied  at  instant  (n+1)  ,  then  the  call  is 

rejected,  and  disappears  from  the  system. 

In  a  more  realistic  situation,  the  network  would  have  many  nodes  -  not 

simply  2,  and  many  possibilities  of  routing  from  node  to  node.  The  adjustment 

algorithm  might  be  different,  but  the  problem  would  be  handled  in  exactly  the 

£ 

same  way.  The  object  is  to  adjust  the  {y  )  sequentially  (based  on  the  system 

n 

behavior)  so  that  some  desired  behavior  occurs.  In  order  to  be  specific,  we 

use  the  following  "linear-t» ward"  algorithm  [3).  Let  denote  the  indicator 

in 


T 

4 
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of  the  event  {call  arrives  at  n+1,  is  assigned  first  to  route  i  and  is 
accepted  by  route  i).  For  practical  as  well  as  theoretical  puri»ses,  it  is 

important  to  bound  yl  away  from  the  (joints  0  and  1.  Let  0  <  yf  <  y  <1.  We 

|Y.. 


use  the  algorithm  (3.1),  where 
1-y,  6(y)  *  -y. 


denotes  truncation  at  y  or  y„,  and  a(y) 
y?  u 


(3-1)  yn+1  =  Iyn  +  ea<Yn)Jln  +  e^yn>J2n] 


Define  a  (•),  6  (•)  such  thata(-)  =  a  (•)  in  [y0,y  -c]  and  £(•)  = 
e  e  f  *■  u 

in  ly^+Ejy^]  and  otherwise  arc  such  that  (3.2)  is  equivalent  to  (3.1) 


(3*2)  W*n+£lV^JL  + 


We  will  study  the  asymptotics  of  the  behavior  of  a  centered  and  normalized 

{y^}  for  small  e.  Part  of  the  difficulty,  which  our  scheme  is  well  able  to 

handle,  is  due  to  the  fact  that  {y£}  is  not  Markovian.  In  the  theoretical  parts 

£ 

of  [2],  [3],  the  problem  is  set  up  so  that  {y^}  is  Markovian. 


e 

Some  definitions.  If  the  choice  probabilities  y^  are  held  fixed  at  some 

value  y  for  all  n,  then  the  route  choice  automaton  still  makes  sense,  although 

there  is  no  learning.  For  fixed  route  selection  probability  y  £  (0,1),  let 
1  2 

(Xn<y)}  *  (OC^ty) ,X^(y»,  0£n<»)  denote  the  corresponding  route  occupancy 
process.  For  the  process  (x^Jy)),  the  state  space  Z  *  { ( i , j ) s  i^N^,j<N2)  (whose 


points  are  supposed  ordered  in  some  fixed  way)  is  a  single  ergodic  class,  and  the 
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probability  transition  matrix,  denoted  by  A* (y) ,  has  infinitely  differentiable 

components.  With  qiven  initial  condition  (PlX^(y)=u),  u07,l  define  P^(ot  |y>  - 

p{X  (y)=»a)  and  the  vector  P  (y)  =  { (a  | y )  r  z)  .  Then 

n  n  n 


(3.3)  Pn+l<y)  *A(V>pn(y>- 


The  pair  t  (XL  ,yb)  ,n>0)  is  a  Markov  process  on  7,  y  ly„,Y  1  and  the  marg  inal 
n  n  —  v  u 

transition  probability  P{X^+1=(k ,  J.)  !  X^=(i ,  j)  ,  yM  is  just  the  ( (i,  j) -column, 
(k,i)-row)  entry  of  A*(y^)  .  Define  the  vector  P^  =  {pNa)  ,  «Gz)  where  pn(a)  = 
P{X^=a|y^,  £<n,  X^}.  Then 

(3-4)  Pn+1  ~  A(yn>Pn* 


Also,  let  P(y)  *  (P(aly)  ,  «€:  Zl  denote  tho  unique  invariant  measure  for  (x^Cy)}, 
with  marginal  defined  by  P^jly)  *  ^P(  j,k|y)  ,  P  (y)  •  (P  (j|y)»  '  and 

similarly  for  route  2.  Finally,  define  the  transition  probability  P(a,j,a.Jy)  ■ 
P{X^.  (y)=a1|xQ(y)=a}  and  write  the  marginal  as 


P  i(c»,  j  »k  |  y)  =  p{X^(y)=k|x0(y)=a}. 


Define  E^  to  be  the  expectation  conditioned  on  (X^,  y^,  i^n) . 

Ni 

A  relationship  of  (3.1)  to  a  differential  equation.  Define  v  *  (1-a^) 
Note  that 


£  € 

E  J,  ■ 
n  In 


(3.5a) 


Uye(l  -  V.I{x';'1»N  n, 
n  in  i 
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(3.5b)  e£j£  -  W(l-y*)  11  -  v_i{x^'2«H-)1. 

n  n  i  x\  z 

For  small  e,  it  is  reasonable  to  try  to  relate  the  behavior  of  (y£)  to  the 

solution  of  (3.6),  where  F(y)  is  just  Ela(y)o!,  +  H(y)J£  ),  but  witli  {x£,y£} 

In  in  n  n 

replaced  by  {X  (y),y)  and  using  the  stationary  measure. 

‘  n 

(3.6)  y  ■  ya(y)y [1-v^P1 (N^ | y) ]  -  p (1-y) 8 (y)  ll-v  P2 (N2 1 y) ] 

o  t  A 

=  py(l-y)  [v2P  (N2 1 y)  -  v^P  (N^ | y )  1  £  F(y). 

X  2 

As  y  increases,  P  (N^jy)  increases  (and  P  (N2|y)  decreases)  monotonically . 

Thus,  there  is  a  unique  point  y  £  (0,1)  such  that  F(y)  *  0.  Also,  F(y)  >0 

for  y  <  y  and  F(y)  <0  for  y  >  y.  We  assume  that  y  £  (y^,yu>  and  we  also  make 

the  apparently  unrestrictive  assumption  that  F^(y)  /  0.  We  actually  will  study 

the  asymptotic  properties  of  UE  2  (y£-y) //e,  for  large  n  and  small  f. .  In 

particular,  let  n£  be  a  sequence  of  integers  tending  to  °°  as  e  -*•  0,  and  define 

the  processes  UE(*)  by  U£(0)  =  U£  and  U£(t)  =  U£  ..  on  (ie,ie+e).  When  the 
c  n  n  +i 

e  c 

Ue(-)  are  dealt  with,  the  {n£}  will  either  be  explicitly  defined  or  their 
values  will  be  unimportant.  We  show  weak  convergence  of  {0£(*)1  to  the  Gauss- 
Markov  diffusion  u(-)  defined  by  (6.3).  If  n£  00  fast  enough  as  £  +  0,  then 
the  limit  u(*)  is  stationary.  The  general  method  can  be  applied  to  many  other 
problems  in  learning,  automata  and  systems  theory. 
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IV.  SOME  PRELIMINARY  RESULTS 

In  this  section,  we  prove  some  auxiliary  results  concerning  uniform  con¬ 
vergence  of  Pn(y)  and  its  derivatives  to  P(y)  and  its  derivatives. 

Theorem  2.  For  each  y  g  [y„,y__),  let  A'(y)  denote  a  Markov  transition  matrix 
(continuous  in  y)  such  that  the  corresponding  Markov  chain  (X^(y) }  is  ergodic 
with  invariant  measure  P (y) .  Then  P ( • )  is  also  continuous  and  there  is  a  6  >  0 
such  that  the  eigenvalues  of  A(y) ,  except  for  the  single  eigenvalue  unity, 
are  bounded  in  absolute  value  by  1-6  for  all  yg  ty^y^*  pn(y)  converges  to 
P(y)  uniformly  (and  at  a  geometric  rate)  in  y  g  an{*  in  PQ(y)  • 


Proof.  The  last  sentence  follows  from  the  penultimate  sentence.  The 
continuity  of  P(-)  is  a  consequence  of  the  uniqueness  for  each  y,  of  the  eigen¬ 
vector  of  A(y)  corresponding  to  the  eigenvalue  unity  (the  invariant  measure) . 
Next,  suppose  that  there  is  no  such  6.  Let  A(y)  be  a  q*q  matrix  and  let 

X^y),  _ _  X^(y)  denote  the  eigenvalues.  Order  them  such  that  A ^ (y)  1. 

Then  there  is  a  y  and  a  sequence  (y^JC  [y?  ,yuJ  such  that  as  yn  -*■  y,  at  least 
one  eigenvalue  (other  than  the  one  which  is  always  unity)  approaches  the  unit 
circle.  In  particular,  suppose  that  the  ordering  is  such  that  IX^y^l  1 
and  that  (choosing  a  subsequence  if  necessary)  the  X^(y^)  converge  to  some  X^ 
as  n+«,  for  i  =  l,...,q.  The  {X^}  must  be  the  eigenvalues  of  A(y)  .  But 
then  A’(y)  is  not  the  transition  matrix  of  an  ergodic  process,  a  contradiction. 


Q«E«Di 


Definition.  Let  E(y)  denote  the  span  of  the  eigenvectors  and  generalized 


eigenvectors  of  A(y) ,  except  for  the  eigenvector  which  corresponds  to  the 
eigenvalue  unity. 


Theorem  3 .  Assume  the  situation  of  Theorem  1 ,  but  let  A ( • )  be  continuously 
differentiable  on  ly£,y  1  (at  the  endpoints,  take  the  left-  or  right-hand  deri¬ 
vatives,  as  appropriate) ;  then  so  is  P(-),  and  Py(y)  is  the  unique  solution  in 
£ (y)  to  the  equation 


(4.1)  Py(y)  =  A(y)Py(y)  +Ay(y)P(y). 


Furthermore,  the  derivative  P  (y) 
-  n,y  * 


given  by 


(4‘2)  Pn+l,y  y  y  n,y  y  y  y  n  y 


converges  geometrically  to  P  (y) ,  uniformly  in  y  £  [y. ,y  ]  and  in  the  initial 

y  X\1 

condition  Pg(y)  /•  if  we  set  PQ  y(y)  *  0. 

If  A(  • )  has  continuous  second  derivatives  on  [y ^ , y^ ] ,  then  so  do  P(>)  and 
Pn  ( • )  ,  and  (y)  is  the  unique  solution  in  E  (y)  to 

<4‘3)  Pyy(y)  =  A(y)pyy(y)  +  2Ay(y)py(y)  +Ayy(y)p<y)> 

Also,  P  (y)  converges  geometrically  to  P  (y)  ,  uniformly  in  y€  [y  ,y  J 
n , yy  yy  ™  ^ 

and  in  the  initial  conditions,  if  P„  (y)  ■  P_  (y)  ■  0. 
-  0,y  0 ,yy 


Proof.  Fix  y.  Since  (I-A(y))V  =  0  for  V  (E  My)  implies  that  V  =  0,  in 
order  for  (4.1)  to  have  a  unique  solution  in  My)  it  is  necessary  and  suffic¬ 
ient  that  Ay(y)P(y)  -1  >f(I-A'(y)),  where  Af  denotes  the  null  space  of  the 
matrix.  I-A'(y))  is  the  set  of  vectors  Q  such  that  A'(y)C>  *  Q.  Since  there 
is  a  unique  eigenvalue  of  value  unity  and  since  the  row  sums  of  A'(y)  are  all 
unity,  the  components  of  Q  must  all  have  the  same  value.  Thus,  the  necessary 
and  sufficient  condition  reduces  to  A^(y)P(y)  Jl  constant  vectors.  For  any 
constant  vector  C  =  (c,c,...)',  C'A(y)  =  C*.  Thus,  C'A^(y)  =  0  and  hence 
A^lylDl  constant  vectors  for  any  vector  D.  Consequently  (4.1)  has  a  unique 
solution  P  (y)  in  My). 

y 

Next,  we  show  that  P  (y)  is  the  desired  derivative.  Write  (for 

y 

y  €  (y£»yu) »  otherwise  6  >  0  or  6  <  0,  as  appropriate) 

A(y+6)P(y+6)  -  A(y)P(y)  =  P(y+6)  -  P(y) . 


Thus, 

(4.4)  [A(y+6) -A(y)J.  p(y+6)  .  (I_A  (y) } . 

The  left-hand  side  of  (4.4)  is  uniformly  bounded  and  is  in  My)  for  each  6  >  0 
(since  (I-A(y))v£  T.  (y)  for  any  V)  and  it  converges  to  Ay(y)P(y)  as  6  -+•  0.  When 
considered  as  an  operator  from  M y)  to  My),  [I-A(y)]  has  a  bounded  inverse. 
Thus,  as  {  +  0,[P(y+6)-P(y) )/6  converges  to  P^(y),  which  must  equal  Py(y) ,  by 
the  uniqueness  proved  above. 

We  now  turn  to  the  convergence  (4.2).  By  Theorem  1,  Pn<y)  converges  geo¬ 


metrically  to  r(y) ,  uniformly  in  t Y p » Yu 1  and  in  P0(y)  •  Also,  since  we  use 
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Pn+l,y(y)  I  An-1<y)A  (y)P.(y) . 
j*0  1 


But  Ay(y)r^(y)  is  a  bounded  sequence  in  >- (y)  ,  and  as  i  it  converges  geo¬ 
metrically  and  uniformly  to  A^(y)P(y).  Also  A(y)  is  a  contraction  when  acting 
in  ?.( y)  ,  uniformly  in  y  £  T*'ese  facts  imply  the  desired  convergence 

of  P  (y) .  The  limit  must  be  a  solution  to  (4.1)  . 
n,y 

The  assertions  concerning  P  are  proved  in  the  same  way  and  wo  omit  the 


details. 


Q.E.D. 
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V.  TIGHTNESS  OF  (UE,  SMALL  t,  LARGE  n) 

_  n  _ _ 

By  "e  small"  and  "n  large"  we  mean  that  there  are  eQ  >  0,  <  n' ,  such 

that  the  assertion  holds  for  c  <  cQ,  n  >_  .  The  actual  value  of  will  be 

unimportant.  Basic  to  the  proof  of  weak  convergence  of  (ul(-)}  is  the  tightness 

of  (Ue,  small  e,  large  n). 
n 


Theorem  4 .  For  each  small  e  >  0,  there  is  an  N^  <  °°  such  that  the  doubly 
indexed  sequence  {UE,  e  small,  n  >  N£)  is  tight,  where  UE  =  (yE-y)//e. 

_  2 

Proof.  Define  v(y)  =  (y-y)  .  We  have 


(5.1a) 


(5.1b) 


e  ,  e 
En  yn+l 


£.2 

.yn)  = 


!yl»2(y')y'u-Vil{X‘'1-»i))  .  s|(y'l  (1-y'l  U-v2Kx''2.»2})  1 


For  small  e, 


En(yn-y)  t“e(yn>Jl,n+0e(yn)J2,n3  -  En(yn‘y)  ‘“‘V  Jl,n+6  (yn3  J2,n3 ' 


since  0  a£{y)  1.  a (y)  and  a£(y)  /  a(y)  only  if  yE-y  1  0  (for  small  e)  ,  and  con¬ 
versely  for  the  B  term.  Using  the  above  inequality,  (5.1a)  and  | y„+l— yn I  *  ' 

(5.2)  EEV(yE+1) -V (y„)  <  2ye(yE-y)  laly^y^l-VjliX^'1-^}) 

+  (i(yt)  (l-y1-)  (1-v  l(xE'2»N  })  )  ♦  0(c2). 
n  n  2  n  2 
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Define  V^tn)  by 


(5.3)  VE(n)  -  2|it  (y^-yluty^y^Vj  }  lP1(N1|y^)  -  P1  (XE  ,  j-n,^  |  yE)  J 


1»n 


+  2ue(y^-y)0(y^U-y|i)v2  £  [P  (N  | y^J  -  r2(X^, j-n,N2|yE)  1 

1-n 


Note  that  P  (Xn,0,N^|yM  *  l{Xn'  *Nj^  •  By  Theorem  2,  the  sums  converge  absolutely 

(the  summands  go  to  zero  at  a  qeometric  rate)  uniformly  in  n,  yL,  X*  .  Thus 

n  n 

| ( - ) |  =  0(e),  uniformly  in  all  the  variables. 

Next,  evaluate 


EEVE(n+l)-VE(n)  =  -2pe  (yE-y)  a  (yE)  yV  [P*  (N.  |  yE)  -  l{xE,1=N,}] 
n  i  i  n  n  n  i  in  ni 


e  n>^ rvi  l..G\  _ 


2ye (yE-y) B (yE)  (l-yE) v  [P2 (N  | yE)  -  l{xE'2-Nj] 
n  n  n  6  /.  n  n  ^ 


(5.4)  ♦  I  2^4,  (E‘(v'<1-5)-,<y^1),‘,1IP1(»1|,^1)  - 

j=n+l 


.l,„e 


(yE-y)a(yE)y  [P1(N  |yE)  -  P1 (XE , j-n,N  | yE) ] } 
n  nn  in  n  in 


-r  a  similar  sum  for  route  2. 


2 

He  next  show  that  the  sums  in  (5.4)  ■  0 (c  )  uniformly  in  all  the  variables 

n,  yE,  XE.  For  simplicity  we  work  only  with  the  first  sum  (route  1) .  By 
n  n 

|yE+1~yE|  =  0(e),  the  smoothness  of  a(0  and  6(0  and  Theorem  2,  the  sum  changes 

by  0(e2)  if  (yE+1~y) a (yE+^) y^+^  i»  replaced  by  (yE-y)a(y^)y^.  Upon  making  the 

substitution  and  using  the  Markov  property  of  (X,(y) ,  j>n)  with  the  value  y  • 

yE  and  "initial"  condition 
7n 


X  (y  )  =  X", 
n  J  n  n 


|y»> 


*=  r1  ( X^ ,  j-n,N1|yS  . 


we  can  rewrite  the  sum  as 


(5.5)  0(e)  l  E‘ { [P1  (N  |yE+1>  -  P^NjyS] 

i=n+l 


-  tpl(xn+i' j_n*1'Nilyn+i)  “  ,’1(Xn+l'j"n‘l'Nllyn>  11  4  0('  2) 


C  £  €  i 

Write  5y  =  y  -y  ,  and  use  the  differentiability  (Theorem  3)  of  the  P  and 
n  n+ 1  n 

the  law  of  the  mean  to  write  (5.5)  in  the  form 


i 

'<‘>4yS  .  I  ,En  |,pJ(Vyn**<yn’  - 

]=n+l  J 


Py(Xn+l' j~n-1'Nilyn4s6yn) lds  4  0(c2) 


By  Theorem  3,  the  sequence  of  absolute  values  of  the  inteqrands  converqes  to 

zero  geometrically  as  j  +  «,  uniformly  in  s,  n,  <5yC ,  and  XL  This,  together 

n  n+1 

with  |6yE  |  =  0(e),  yield  that  (5.5)  is  0(e2) .  The  same  result  holds  for  the 

sum  in  (5.4)  corresponding  to  route  2. 

Define  VE(n)  *  V(yE)+VE(n).  By  (5.2)  and  (5.4)  and  the  fact  that  the 
n  i 

2 

sums  in  (5.4)  are  0(c  ) , 


Ecve (n+1) -VE (n)  <  0(c2)  ♦  2yr'yE-y) (a(yE)ye (1-v  P1 (N  jyE)) 
n  “  n  nnxi  n 


+  S <y*> (l-y^)  a-v2P2 (N2  jyE ) ) ] 


Owing  to  the  definition  of  a(-)  and  (*(•)  and  the  fact  that  y  G  [y.»y  )*  tha 


bracketed  term  has  its  unique  zero  at  *  y  and  it  is  positive  (negative, 
resp.)  for  y£  <  y  (y£  >  y,  resp.).  Thus,  there  is  a  y  >  0  such  that 

(5.6)  Eeve(n+1)-Ve(n)  <0(c2)  -  eyV(y£) . 

n  —  n 

By  |v£(n)|  *  O(r)  uniformly  in  n,  E£V£  (n+1 ) -VC  (n)  <  0(i2)  -  iyvL(n),  and  hence 

(5.7)  EVC(n)  <_  (exp  -  eyn)EV£(0)  +  0(e). 

Again,  since  |v£(n) |  =  0(e) ,  uniformly  in  n,  (5.7)  holds  for  V(y£)  replacing 
V£(n),  from  which  the  existence  of  the  )  and  the  asserted  tightness  follows. 
In  particular,  let  0  <  KQ  be  arbitrary  and  let  be  the  smallest  integer  n 
such  that  (exp  -  eny)  K^c.  Q.E.D. 


i 
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VI.  WEAK  CONVERGENCE  OF  {UL(*)) 


Definition.  Recall  the  definition  of  N  given  at  the  end  of  the  proof  of 

Theorem  4.  For  any  sequence  of  integers  n  >N  ,  define  0  *  n  -N  .  Define 

c  t  c  i  t 

yE  =  y£  .  and  similarly  define  the  "shifted"  sequences  U*  ,  X*  and  J1  .  Then 
n  n  +n  n  n  in 


(6.1) 


O’  =  V€  +  Gla  (y  ) JE  +  6  (y  )JE  ] 
n+1  n  e  n  In  f.  n  2n 


By  Theorem  4, 

•>£  »N 
J .  via 
in 


(uE, 

n 


e  small)  is  tight. 


For  each  integer  N,  define  UC,N,  yE,N, 

n  n 


(6.2) 


=3E'N  + 

n+1  n 


^[«(V-)^;N  +  B(y 


^'N)jE;Nib  (uE'N) , 


where  b^t-)  is  defined  above  (2.1)  and  we  set  UQ'  *  Ug  if  |  XJQ  |  <_  N  and  equal 

to  zero  otherwise.  Also  UE'N  *  (ye,N-y)/^e  defines  yE,N.  J?'N  is  simply  the 

indicator  function  of  the  set  (route  i  is  tried  first  and  call  accepted)  for 

the  system  (XE'N,yE'N),  wb9re  the  choice  probabilities  (yE'N)  arc  used  to  select  the 

routes  and  (x^'  )  is  the  corresponding  route  occupancy  process.  We  suppose 

that  XE'N  -  XE  .  Let  Ee,N  denote  expectation  conditional  on  yE,N  and  XE'N. 

0  n£  n  n  n 

Since  |yE'N-y|  <  /e(N+l) ,  for  small  t  it  is  irrelevant  whether  we  use  a  ,  6  or 
a,  8  in  (6.2),  and  we  use  a,  8  for  simplicity.  By  Theorem  1,  if  we  show  that 

jg 

(for  each  N)  (U  ‘  (•))  is  tight  and  that  all  weak  limits  satisfy  (6.3)  until  first 
escape  from  SN,  then  { £fe ( * )  )  is  tight  and  all  weak  limits  satisfy  (6.3). 

We  now  define  some  auxiliary  processes  which  are  used  in  the  averaging 
method  ennloved  in  the  proof.  Let  P  denote  the  measure  defined  bv  the  stationary 
nrocpss  (X^(v),  .«*>-]>-'«■},  with  correspond  inn  expectation  operator  E.  For  each  n, 
it  is  necessary  to  introduce  the  process  (X^(y),  j>n},  but  with  "initial"  condition 
Xnly)  ■  3(E'N.  (I.e.f  after  time  n,  the  route  choice  probability  is  y.)  The  opera- 
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tor  E^'  denotes  the  expectation  of  functions  of  this  process  (x^(y),  j>n)  condi- 

tional  on  the  ''initial"  condition  X  (y)  =  X  *  .  Let  J.  .  (y)  denote  the  indicator 

n  n  1  j 

function  I{call  arrives  at  j+1,  is  assigned  to  and  accepted  by  route  i } ,  when  the 


route  choice  variable  is  y  and  the  route  occui>ancy  process  is  (X^(y)}.  Whether  we 
intend  the  ergodic  process  or  the  process  {x^(y),  i>n}  starting  at  time  n  with 


X  (y)  =  will  be  made  obvious 


by  use  of  either  E  or  Et,N.  Define 

n 


6u_.(y)  =  (otiyJJ^iy)  +  8  (y)  (y)  ] . 


Under  P,  the  right  side  has  zero  expectation. 


Theorem  5.  For  any  sequence  n^  >  N  ,  {U  (•))  is  tight  in  D[0,°°)  .  All  weakly 
convergent  subsequences  converge  to  a  Gausa-Markov  diffusion  satisfying  (6.3). 

If  -*■  <**  as  e  -+  00 ,  then  the  limiting  diffusion  u ( • )  is  stationary  in  that 
u(0)  has  the  stationary  distribution.  (In  all  cases  u(0)  is  independent  of  B(»).) 


(6.3)  du  *  Gudt  +  odB,  B(>)  =  standard  Brownian  motion, 


(6.4)  G  »  Fy(y)  =  uy(l-y)  lv2P2(N2|y)-v1P1 (N1 |y) ) 


y=y 


(6.5)  o2  =  E(6uQ(y))2  +  2  £  E  6uQ(y)6u  (y) . 

n»l 


Proof.  Part  1 .  Until  Part  4,  all  superscripts  N  will  be  omitted. 

,1c  =€  , i  .  -  ,le,N  _t,N  £t,N  ^e,N  ^c,N,i  , 

we  write  (E  , E  ,X_,y_,X  for  tfi_  ,E_  »X_  , y  ,X_  ,...). 

n  n  n  n  n  — —  n  n  n  n  n 

actually  work  with  the  N- truncated  process  in  Parte  1  to  3. 


Thus 

We 
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By  (5.1) , 


(6.6) 


E^(U^  -uS  =  ^(l-L)  W,  lfXL,2=N  )-v  l(Xl  ,1«N  )]b  (UH) 
nn+ln  n  n2n  2  I  n  INn 


^2  3 

Let  f ( •  ,  • )  the  space  of  bounded  (x,t)  functions  with  compact 

support  whose  mixed  partial  derivatives  up  to  order  2  in  t  and  3  in  x  are 

%|  j' 

continuous.  To  apply  Theorem  1  to  1U  (•)  (,  wo  will  qet  an  f  (•)  of  the  form 


f£(ne)  =  fiuSne)  +  f^(ne)  +  f^(ne)  +  f^(ne) 
n  0  1  2 


where  the  f^fne)  will  be  defined  in  the  sequel.  For  each  N,  all  o(-)  or  0(*) 
are  uniform  in  all  variables  except  their  argument.  We  have 


*\»£  *\/F  %r  '\zf  r\jf~ 

E  f(U  .ne+O-fOJ  ,nc)  *  Z  [f  (u*  ,nc) -f  (uE  ,nc)  )  +  f  <U  ,ne)e  +  o(e)  , 
irrjL  n  n  n+i.  n  t  n 


K^K+v'*' *  5%u»' ">> < vrsv  ♦  oic, 


<6.7,  . 

+  fuu(U^,nE)ENu^+i-ij^)2  +  0(e). 
2 


By  the  differentiability  result  of  Theorem  3,  we  can  rewrite  the  term 
before  the  o(e)  as  follows:* 


ebj?(Ue)  f 
N  n  uu 


'VE .  'VC 


2n 


) 


2 


+  o(e)  . 


•The  terms  EeJ,  (y)  and  (y*")  differ  only  in  that  in  the  first  case  y  Is  used 

n  m  n  in  n  \  ■  \ 

as  the  choice  variable  to  get  the  successor  state  to  Xc,  and  yc  is  used  in  the 

n  n 

second  case.  „ 
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Part_2.  We  will  “average  out"  the  terms  in  (6.7)  one  by  one.  Define 
f^(ne)  (analogous  to  the  definition  of  V^n)  in  the  last  section) 


(6.8)  f  j  (nr)  =  4 ^(U^y^l-y,,)  ^(U^m  )  l  l  v.,  (P2  (' j-n.N  J  y^) 


l’"(N2|y^))  -  ^  <PJ  (XE,  j-n,N1|y^)  -  P1  (NJ  |yE) )  ]  . 


(VL  .  1-n  M  I  xY‘  »  _  I  .1.'' 


Proceeding  analogously  to  the  method  of  Theorem  4  for  VE(n),  we  evaluate 


(writing  P1(xSj-n,N  |yE)  in  the  more  convenient  form  ECP1(XC  ,i-n-l,N  |vL) 
n  1  n  n  n+1  J  i|yn 


2;c„i  ,ymc 


in  below,  for  j  >  n;  see  above  (5.5)) 


(6.9)  £efE  (ne+e) -fE  (ne)  =.  T,  +  T  +  T,. 

n  1  1  12  3 


vl  '  ^  XI  l  T  1 


t2  -  ^Ku(8Cfl.»«c)bH(J)^1)^1(l.5^1)|  • 


j=n+l 


’  V1 (pl  (*n+l '  3-n‘1  'N1  I K+l]  ~pl  (N1 1 ^  >  ] ' 


T3  "  -^IfU(^ync,^Cl-y-,bN(r,l 


l  SnIv2(p2(5fn+i'j-n-1'N^y")-p2(N^y')) 


j«n+l 


2  1 7n  *  2 1 7n 


-  Nil?;, 


Using  the  dif ferentiability  result  of  Theorem  3  and  the  fact  that 


2  1 

V2P  (N2|y)  =  vxP  (Njy)  ,  we  get  that  equals  the  negative  of  the  first  term 


on  the  right  side  of  (6.7)  plus 


(6.10)  eyb.I(UE)-r-ly(l-y)  lv,P2(N  |y)-v  P*(N  |y)  JlV")  I  +o(t). 
N  n  3y  '  1  2  2'  1  1 1  n 


In  T2.  by  replacing  yE+1  by  y^  and  bN<UE+1) fu<UE+J ,ne+t)  by 


/X/f  'Vf  'X/r  'l/f  'X>(' 

b„(U >f  (U ,ne)  +  (b  (U  ) f  (U  ,nc))  (u\  ,-U  )  , 
Nnun  Nnun  un+ln 


we  only  alter  the  term  by  o(e) .  Let  us  make  these  replacements  in  T2  and  denote 

0  0  2  2 
the  resulting  term  by  T2_  Now,  split  T2  into  two  parts  (T^,  T^)  ,  the  f^rst 

(second,  resp.)  being  T  but  with  b  (UE)f  (UE,ne)  ( (b  (UE) f  (UE,ne))  (UE  -UE) , 

2  Nnun  Nnun  u  n+1  n 

've 

resp.)  replacing  bN  ^Un+l^  fu*Un+l '  nC+  ^  *  By  the  fl erentiability  results  of 

Theorem  3  and  the  fact  that  lyn+^~ynl  =  °(e)  and  an  argument  like  that  below 
(5.5)  ,  it  can  be  shown  that  +  T3  =  o(e)  .  Thus 


—  'X/C-  'X/C  'XjC 

eyy  (1-y  ) (b  <U  ) f  (U  ,ne)) 
j  n  nNnun  u 


j=n+l 


Vl(pX(^n+l' j_n"1,Nl^n)  "  pl(Nl^n))]‘ 


We  now  simplify  (6.11a)  by  a  series  of  replacements,  each  one  altering  the 
term  by  o(c).  First  replace  all  the  yE  by  y.  By  Theorem  3  and  |uE+1-UE|  *  0(/e) 
and  a  differentiability  argument  such  as  used  below  (5.5),  this  only  alters  the 
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term  by  o(e) .  Since  v  P2 (N  | y) -v  P1 (N  jy)  ■  0,  we  delete  this  part  of  the  result 

*  *  i  i 

ing  summand.  We  now  have 


r~  'Vr  'W  'Vr  r  f 

(6.11b)  T,  +  T,  -  o(e)  +  /e  b(u‘)f  (U  ,ne)]  E,'(u\_-U  )  \  q. 

j  \  Nnuri  ,  r\  r\  4-  1  r»  *-*  *  4 


L2  ‘3 
where  for  j  n+1, 


N  n  u  n  u  n  n+1  n  .  c  ,  "*V 

3»n+l  J 


qj  =  [v2p2(^n+l'  3-n-1fN2|y)-ViPi(X^+i,  j-n-i^ily)  luy(l-y)  =E^+16uJy) 


Finally,  by  the  differentiability  result  of  Theorem  3,  (6.11b)  equals 


(6.12)  T  +  T  =  o(e)  +  ebM(UE)  [b.,(UE)f  <U£,ne)]  V  5*6u  (y)  6u .  (y)  . 
£  J  NnNnun  u.^nn  -i 

3=n+l  J 


The  difference  between  (6.11b)  and  (6.12)  is  simply  due  to  whether  yE  or  y  is 

n  1 

_  'V/f*  n.f 

used  to  get  X  and  U  from  X  and  U  . 

n+l  n+1  n  n 


Part  3.  Now,  we  "average  out"  the  sum  in  (6.12)  . 


Define  fE(ne)  by 


fE(ne)  =  eb  (UE) [b  (UE) f  (UE ,ne) ] 

£  NnNnun  u 


l  l  [iT'fiu  (y)5ir(y)-E<5u.  (y)6u.(y)  ]  . 
j=n  k= j+1  3  3  k 


By  the  (uniform)  geometric  convergence  result  of  Theorem  2,  the  sum  converges 
absolutely  and  If^ne)  |  *  0(e).  By  a  straightforward  calculation  using  the 
stationarity  of  {6u^(y) }  under  P,  we  can  show  that 

&EfE (ne+e) -fE (ne)  -  -  (6.12)  +  o(e) 
n  z  z 

00 

+  ebN(Un) [bN(”n)fu(^n'nc) ]u  I  E(6uQ (y) 6u  (?) ) . 

Finally,  we  treat  the  term  before  the  o(e)  of  (6.7)  -  in  the  form  in 
which  it  is  written  below  (6.7)  .  Define  fE(ne)  by 


(WV 
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f  (UE,ne) 
,t .  .  uu  n 

fQ(ne)  -t - 2 - 


00 

b2<3E>  l  lE^(5u  (y))2-E(6u  (y))2]. 


j=r> 


By  a  procedure  similar  to  that  used  for  f^ne)  ,  it  can  readily  be  shown  that 

z  z  fuu^Un'n£^  2  'V't  —  -  2 

EEf  E ( ne+e ) - f E ( ne )  =  o(c)  +  e  - -  bjj (UE)  E (6uQ (y) )  z 

f  (U£,ne) 

- e  -JSL2S —  bN(un>i^[D,(y)Jin(y)+B(y)J2n(y)1  • 


Summarizing  the  previous  calculations 


EEfe (ne+e) -fE (ne)  =  o(e)  +  ef  (uE,ne)  +  ef  (UE ,ne) GUEb  (UE) 
n  tn  unnNn 


(6.14)  +  efu(UE,ne)bN  (^)bN(U^  \  E  6uQ(?)6u  (y) 

1*1 


f  (UE ,ne) 
uu  n  .  2 ,J*e. 


+  e  — ~ 2n -  bN(UE)  [E(6u0(y) )  +  2  [  E  6u  (?)6u  (£)  ] 

^  1«1  3 


Part  4 .  Conclusion.  Reintroduce  the  superscript  N.  Fix  N.  All  the 
fE,N  are  bounded  and  of  order  0(/e)  and  (UE'N)  =  {uE,N(0)}  is  tight.  Also 
ilE'NfE'N(ne+e)-fE'N(ne)  *  0(e).  Thus,  by  [7],  Theorem  2, 

the  bounded  sequence  {u  '  (•)}  is  tight  in  D[0,») .  Let  e  index  a  weakly  con- 

N 

vergent  subsequence  with  limit  U  (•) .  Since  A  is  defined  to  be  the  infinites¬ 
imal  operator  of  (6.3),  by  (6.14)  and  Theorem  1,  we  see  that  U1^*)  solves  the 

M 

martingale  problem  corresponding  to  an  infinitesimal  operator  A  whose  coeffic- 

f\,c 

ients  equal  those  of  A  in  S^.  Thus,  by  Theorem  1,  {u  (•)}  converges  weakly  to  a 
solution  u(*)  of  (6.3).  The  independence  of  B(.)  and  u(0)is  a  consequence  of 
the  fact  that  o(*)  is  the  unique  solution  to  the  martingale  problem.  The 
stationarity  assertion  is  not  hard  to  prove,  but  we  omit  the  details.  Q.B.D. 


1 
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VII.  ASYMPTOTIC  THEORY  OF  AN  ADAPTIVE  QUANTIZER:  INTRODUCTION 

In  recent  years  there  has  been  a  great  deal  of  effort  concerning  the  effic- 

1  ient  quantization  of  signals  in  telecommunications  systems,  e.g.  of  voice 

signals  in  telephone  transmission  systems.  Let  z(0  denote  the  actual  signal 

process  and  A  a  sampling  interval.  In  the  problem  of  interest,  the  signal  is 

sampled  at  moments  {nA,  n=0,l,...},  then  the  samples  {z(nA) }  are  quantized,  and 

it  is  only  the  quantized  samples  which  are  transmitted.  Let  0  *  <  . . .  < 

£  <  £  =  °° ,  0  =  n.  <  n_  . . .  <  nT  where  £.,  n.,,,  i  =  0,  ...,  L-l,  are  real 

L- 1  L  12  I-i  1  1+1 

numbers.  Let  the  quantization  function  Q(*)  be  defined  as  follows:  there  is 
a  y  >  0  such  that  for  z(nA)  >  0,  Q(z(nA))  =  yru  if  z(nA)  £  [y£^_^,y£^) ,  an(*  set 
Q(-z)  =  -Q(z)  .  The  parameter  y  is  a  scaling  parameter.  As  the  signal  power 
increases  (decreases) ,  y  should  increase  (decrease)  for  efficient  reconstruction 
of  the  signal  from  the  sequence  of  quantizations. 

The  problem  of  choosing  appropriate  values  of  y  when  the  signal  powers  can 
vary  by  an  order  of  magnitude  or  more  has  led  to  the  study  of  adaptive  quan¬ 
tizers.  We  give  only  a  brief  description  in  order  to  formulate  the  problem. 

For  more  detail  and  discussion  of  the  engineering  considerations,  the  reader 
is  referred  to  the  references  [4],  [5],  Let  e  denote  a  "rate  of  adjustment” 
paramett  or  the  scale  parameter  y  and  let  y^  denote  the  value  of  the  adapted 
scale  i  '.neter  at  the  nth  sampling  instant.  Set  6  €  (0,1]  and  let  0  <  < 

M2  <  '  ’ '  <  <  °°  M 

is  a  truncated  form  of  the  (typical  in  such  an  application)  adaptive  system 
(7-l)  y^+1  =  <yn)6Bn'  where  Bn  =  Mi  if  lz(nA)  1  G  tyn5i-l 'ynV  * 


0  0 

<  1,  >  1.  We  study  an  adaptive  quantizer  which 


Goodman  and  Gcrsho  (4]  did  a  thorough  analysis  of  (7.1)  for  the  case  6  • 

1  and  (z(nA)}  independent  and  identically  distributed.  With  $  <  1,  the  syston 
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has  some  desirable  robustness  properties  and  this  case,  together  with  simula¬ 
tions,  is  discussed  by  Mitra  [5]  and  others.  The  last  reference  is  concerned 
more  with  reconstruction  of  the  process  z(*)  from  {Q(z(nA) ) !  and  does  not  give  an 
asymptotic  analysis. 

Generally,  with  non-i.i.d.  {z(nA)},  it  is  hard  to  get  concrete  information 
on  {yE}  for  large  n.  If  the  signal  power  varies  over  time  or  if  (as  is  real¬ 
istic  for  moderate  values  of  A)  {z(nA) }  is  not  i.i.d.,  then  techniques  such 
as  used  in  [4]  fail,  but  for  small  rates  of  adjustment  (e)  an  asymptotic 
analysis  can  still  shed  light  on  the  process  behavior.  At  the  present  time, 
it  seems  that  little  more  can  be  done  for  the  general  case.  Here,  we  scale  the 
problem  so  that  an  asymptotic  analysis  is  possible.  For  mathematical  as  well 
as  practical  purposes,  it  is  useful  to  confine  yE  to  some  finite  positive 
interval  Now,  we  define  the  truncated  form  of  (7.1)  which  will  be 

studied.  Let  a>0,0<ae<l  and  let  {JL}  be  real  numbers  such  that  < 

*,<...<  1.  and  i  <0,  Jl  >0.  Then  we  use 
2.  LI  L 


(7.2) 


n+1 


(yV’^  Be 
n  n 


where  I  denotes  truncation  and 


BE  «  <l+e£.)  if  | z(nA)  j  £  [yE£.  , ,yE£.). 
n  l  11  Jn  i-l  Jn  i 


The  asymptotic  results  can  be  used  to  get  information  on  the  effects  of  the 

A,  structure  of  z(0  and  a  on  the  performance  for  small  e.  For  notational 

convenience  below,  let  <  1  and  yy  >  1,  Rewrite  (7.2)  in  the  form  (7.3),  where 

y1  Cn  -  y[l-ea  log  y]+OU2)  and  (l+ebE)  :  BE  are  used,  and  F  and  be  have  the 

n  n  n 

obvious  definitions. 


-27- 


y  y 

(7.3)  ye  »  lyr  (l+eb1") -tay^  loq  yC+0(t2)  )  U  i y *  +•!■'( y '  ,  z  ( nA ) )  +0  ( i. 2 )  ]  U. 

n+1  n  n  n  n  n  n 

y*  y 9 

In  [4],  the  process  (log  y^}  rather  than  {y^}  is  dealt  with. 

We  proceed  in  very  much  the  same  way  that  we  did  for  the  automata  problem. 
The  main  difference  arises  from  the  unboundedness  of  (z(nA)},  under  assumption 
(7.6) .  By  definition, 

L 

bn  =  f  «-il{  |  z(nA)  |  €  (y^i_1,yn^i) ). 
i=l 

There  are  continuous  functions  i^(-)  such  that  (7.4)  and  the  properties  below 
it  hold. 

(7-4)  yn+l  =  yn(1+e6n(yn1  ]  ~  eayn  log  yn  +  0(r'2) 

=  y^  +  eFe(y^,z(nA) )  +  o(e2) , 

where 

L 

(7.5)  6n(y)  =  E  tj(y)i{|*(n&)|  €  lyC^j.y^)  . 

i»l  x 

E  E 

Also,  A^(')  can  be  chosen  such  that  id*)  *  JL  out  of  an  0(e)  neighborhood  of 
y.  (resp.  y  )  if  t.  <  0  (resp.  1.  >  0),  and  0  >  i€.  iy)  >  l.  for  i.  <  0  and 
0  <  £*(y)  <  l.  for  i.  >  0. 

—  l—i  l 
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Some  assumptions .  For  specificity,  z(-)  is  assumed  to  be  a  stationary 
Gaussian  process  with  a  rational  spectral  density.  Thus  there  are  an  asymptot¬ 
ically  stable  matrix  M,  a  matrix  C,  a  row  vector  D,  and  a  process  v(-)  such  that 

(7.6)  dv  =  Mvdt  +  Cdw 

z  =  Dv,  w(*)  =  vector-valued  standard  Brownian  motion. 


This  assumption  is  not  essential  -  only  certain  smoothness  properties  of 
the  multivariate  density  are  used,  together  with  the  exponential  rate  of 

decrease  of  the  effects  of  the  initial  conditions. 

.  -  2 
Define  F£ (y)  =  EFe<y,z(nA))  and  F(y)  =  EF(y,z(nA)).  Let  oQ  =  var  z(t) . 

We  have  (the  subscript  y  denotes  the  derivative) 


(7.7)  5_(£^) 

dy  y 


i=l 

0 

L-l 


A'iui 


exp 


r2  2 
!iL 
20? 


r2  2 

r  Ci-ly  .  , 

?i_i  exp - ^ — ]  -  a/y 


2o, 


l  Ui_!li+i)ei  exP  "^iy2/2oo  '  a/y 


i=l 


We  can  see  from  the  terms  in  (7.7)  that  F(y)/y  is  the  sum  of  two  strictly  convex 
functions,  the  first  being  bounded  and  having  a  negative  slope,  and  the  second 
going  to  00  as  y  -*•  0  and  to  as  y  ■+  «.  Thus  there  is  a  unique  y  E  (0,<»)  such 

A  v  B  A  A  v 

that  F  (y)  =  0.  Also  f'(y)  >0  for  0  <  y  <  y  and  F(y)  <0  for  y  >  y  and  Fy(y) 

ft  0.  We  assume  that  y  £  •  For  small  e,  the  assertions  in  the  last 

sentence  hold  with  F  replacing  F.  Define  U£  ■  (yt-y)//e  and  let  E  denote 

t:  n  n  n 

expectation  conditioned  on  (v(jA),j<n). 


VIII.  TIGHTNESS  OF  (U  ,  SMALL  C,  LARGE  n) 
_ n _ 

The  proof  is  similar  to  that  of  Theorem  4  in  Section  V  and  we  only  set 

it  up  and  indicate  how  to  deal  with  the  fact  that  Iz(nA) }  is  unbounded. 


Theorem  6.  Under  the  conditions  in  Section  VII,  the  conclusions  of  Theorem  4 
hold. 


Proof.  Define  V(y)  *  (y— y )  .  There  is  a  y  >  0  such  that  (y-y)F(y)  <  -yv(y), 
all  e  >  0  and  y  G[y^,yu].  We  have 


,  e  e,  2  2, 

(yn+ryn}  =  °(E  > 


yn+l  =  yn+E^e  lyn]  +e  lFc  (yn ' z(nA) >  "yE  ^  J+°(e2)  ' 


F£(y)  -y l  (y) P(yC^_1^| z(nA) |<y£^}  -  o(y  log  y) , 

i=«l 

(8 .1) 

L 

EE(yE+i-yE)  =  cT^y^)  +  e  yE  l  a1,  (y^)  [Plyf.._1<|z(nA)  |<yti|v(nA-A)  ) 


-  P(y  z(nA)  |  <y  £^}J  +  0(s.) 

A  y-yl 


As  done  in  connection  with  (5.2)  (where  ae,  Be  were  replaced  by  a, 8) f  we  get  an 
upper  bound  for  the  second  moment  by  replacing  fc^(yE)  by  V  (hence  F£  by  £)  .  Thus 


(8.2)  EEV(y€  ) -V(yE)  <  0(e2)  +  2e (yE-y)F(yE) 
n  n+i  n  —  n  n 


+  2(yE-y)  [sum  in  (8.1)  with  ?.E(')  replaced  by 


Next,  define  V^(n)  by  V^n)  ■  vi(n'yn>*  where 

«  L 

(8.3)  V^(n,y)  *  2e  (y-y)  \  I  \  y^  (PlyS^^J  z(  jA)  |  |  v(nA-A) } 

j*n  i=l 

'  p{y^i_iilz(nA) ! <yc1>i . 

|v^(n) |  can  be  estimated  by  use  of  the  following  fact.  There  are  <  *  and 

a  >  0  such  that  |eMt|  <_  KQe  at.  There  is  an  a^  >  0  and  a  <  “  such  .that  for 

-at  /2 

X2  >  T1  >  0  and  on  the  set  (v(t) :  | v(t) |e  £1} , 

-at 

(8.4)  |p{v(t+Ti)G  Bi#  i=l , 2 1 v(t) }  -  Pfvft+T.ieB.,  i*=l,2}|  <  l^e  1 
for  all  B^,  B^ . 

In  order  to  use  (8.4)  (in  this  application  we  set  B *  range  space  of  v(t)), 
write  the  sum  in  (8.3)  as 

H  00 

(8.5)  l  +  l  , 
j=n  j=H+l 

where  H  =  min(m:  e  ^m-n^ | v(nA-A)  j  <^l)  =  0(l+max(0,  log|  v(nA-A)  | ) )  .  Then 
the  first  sum  in  (8.5)  is  0(l+max(0,  log|v(nA-A) | ) ) ,  and  the  second  is  0(1)  by 
(8.4)  and  the  summability  of  ^>Qexp  -a^jA.  Thus  |v^(n)|  = 

0(e)  (l+max(0,  log) v(nA-A)  | )  ]  0(e)  (1+j v(nA-A)  | )  .  From  this  point  on,  the 


proof  is  exactly  the  same  as  that  for  Theorem  4. 


Q.E.D. 
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IX.  THE  LIMIT  THEOREM 

We  continue  to  use  the  tilde  terminology  of  Section  VI,  and  define  Un>  yn, 
£e,  etc.,  as  there.  Also,  set  z(nA)  *>  z (n  A+ni)  and  v(nA)  -  v (n^A+nA) .  The  idea 

now  is  still  to  prove  weak  convergence  of  UE(*).  We  use  Er  for  expectation  condi' 

%e,N  ,  ^e,N  >eiN  -  .r~. 
tional  on  {v(jA),  j<n+n£}.  We  have  <<9.1b)  defines  yn  by  *  (yn  -y)//e) 


(9.1a)  U£  =  UE  +  /eF  (yE)  +  </c(?  (yE,z(nA))-F  (yE) )  +  0(e3//2)  , 
n+i  n  e  n  c  n  e  n 


(9.1b)  UE'"  =  UE,N  +  /e[F  (yE'N)  +  (F  (yE,N,z(nA))  -  F  (yE'N))  +  0<e3/2).]b  (3E'N) 
n+1  n  e'n  e  n  e  'n  Nn 


Theorem  7 .  Under  the  conditions  of  Section  VII,  the  conclusions  of  Theorem  5 


hold,  but  where  G  =  F^(y)  and  (stationary  process  z(-)  used) 


a2  =  EF2 (y , z(0) )  +  2  l  EF(y,z(nA) )F(y,z(0) ) . 

n*l 


Remark .  If  M,  C  or  D  were  time-varying,  then  an  extension  of  the  tech¬ 
nique  is  possible,  provided  that  the  time  variation  per  step  is  O(e) .  The  limit 

diffusion  yields  information  on  the  dependence  of  the  performance  on  the  para¬ 
meters  a,  {t^},  A,  as  well  as  an  estimate  of  the  asymptotic  variance  and 

correlation  function  for  small  e. 

« 

Proof .  Except  for  the  unboundedness  of  the  noise  (z(nA)},  the  proof 
would  be  essentially  the  same  as  that  of  Theorem  5,  and  only  an  outline  will 
be  given.  ' 

I N i  * 

Owing  to  the  truncation  |u  '  |  <  N+1,  the  F_,  F^  in  (9.1b)  can  be  replaced  by 

n  e  t 

F  and  F,  respectively,  without  changing  the  values,  for  small  c.  Let  us  make 

■  *2  3 

the  replacement.  Fix  f  (*,•)€:  '  .  Drop  the  superscript  N  on  all  variables 

for  no tat Iona 1  convenience,  as  done  in  Theorem  5.  Then,  by  a  Taylor  expansion. 
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(9.2)  Eef(Ue^,  ,ne+t)-f(Ut,nk:)  -  o(e)+ef  <ue,nt)+ef  (UL,ne)F  (9)Ueb„(UE) 

n  n+1  n  tn  u  n  ynNn 

+  Jcf  (U£,ne)E£  (F(y+i^eUE,z(nA)  J-Fty+ZcU^)  )b  (UG) 
u  n  n  '  n  J  n  N  n 

+  |  fuu(UG,ne)EE[F{y+/eUE,z(nA))-F(y+/£UG) )2bJ(UE)) . 

Since  the  second  derivative  of  EnF(y,z(nA))  with  respect  to  y  is  bounded  by 
constant (1+ I v(nA-A) |],  the  next-to-last  term  of  (9.2)  can  be  written  as 

(9.3)  /ef  (U£,ne)Ee[F(y,z(nA))-F(y) ]b„(UE) 

u  n  n  n  n 

+  ef  (uE,ne)-|~£E [F(y, z(nA) )-F(y) )  UEb  (UE) +o (e)  [1+ 1 v(nA-A)  |  ] . 
u  n  3y  n  J  1  -  n  N  n 

y-y 

The  last  term  of  (9.2)  can  be  written  as  l recall  that  F(y)  *  0) 

(9.4)  f  (Ue,ne)Ee [F(y,z(nA) )-F(y) l2b2 (UE)  +  o(e) . 

2  uu  n  n  N  n 


Now,  we  use  the  method  of  Theorem  5  in  order  to  average  out  the  terms  of 
(9.2).  We  use  fE(ne)  =  f(UE,ne)+  £?=3fE(ne).  Define  fE(ne)  by  (to  average 
out  the  second  term  of  (9.3)) 


fE(ne)  =  efu(UE,ne)bN(UE)UE  £  g-  EE [F(y,z( jA) )-F(y) ] 

j=n  y 


By  an  argument  similar  to  that  used  below  (8,5),  together  with  the  derivative 

bound  stated  above  (9.3),  it  can  be  shown  that  ECfE (nc+e)-fE(ne)  *  -  (second 

no  3 

term  of  (9.3))  ♦o(i)|l  ♦  |v(nA-A)|2]  and  that  |fE(nc)|  <  0(e) [1  +  |v(nA-A)|). 
Next,  introduce  f'  (n» )  (to  avuiagc  out  (9,4)): 
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f£(ne>  *  2  fuu<^n'nC)bN(^n)  E  (E^F2 (y , z( jA) ) -EF2 <y , z( jA) ) ) . 

j=n 

Then,  as  for  f£,  we  have  |f£(ne)i  <_  0(e)  11  +  |v  nA-A)|).  Using  this,  it  is  not 
hard  to  show  via  a  small  amount  of  manipulation  that 


Ef  (L.  (ne+e)-f‘  (m  )  *  - 
n  4  n 


f  w'  .mlbJu")  l£LF2(y,z(nA))-EF2(y,z(nA))  1 

?  uu  n  N  n  n 


+  o(e)  U  +  j v(nA-A) | 1 


Next,  introduce  f5 


fe(ne)  in  order  to  average  out  the  first  term  of  (9.3): 


f£(ne>  =  ^  f  (uSne)b  <U£)  \  EnF(9,z( jA> > . 
5  j»n 


Then,  again.  |f£(ne)|  *  o(/F) (1  +  |v(nA-A)|)  and  we  can  write 


(9.5a)  Eef£(ne+e)-f£(ne) 
no  p 


■  -  (first  term  of  (9.3) 


*  t*X(3n,l-"e’V6n,l,-VV“V3n’1,  I  **'<**»» 

j=n+l 


With  a  small  amount  of  manipulation,  we  can  show  that  the  last  term  of  (9.5a)  equals 


(9.5b) 


tfu(n,n£)bN(  n)3u  E  E£F(y,z(  jA)  )F(y,z(nA)  )+o(e)  1 1-f  |  v  ( nA-A)  |  J . 
u  j»n+l 


Finally,  f^int)  is  introduced  in  order  to  avoraqo  out  the  sum  term  in  (9.5b)  in 
the  same  way  that  f^(nt)  was  used  to  average  out  (6.12)  in  Theorem  5.  Define 
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(9.6)  fE (ne)  =  e  (f  (UE,ne)bM(UE)  ]  bM(lJe) 
6  un  NnuNn 


‘  1  1  (Rl'F(y.z(kA)  )F(y,z(  jA)  )-EP(y,z(kA) )  F(y,z(  jA) )  )  . 

j=n  k=j+l  n 


By  (8.4),  fE(nt)  is  well  defined  and  is  0(c) [1  +  |v(nA-A)|2],  as  will  now  be 

V> 

proved . 

c 

Define  H  as  below  (8.5)  and  let  E  B  denote  the  (j,k)th  summand  in  (9.6) 

n  jK  - 

and  write  the  sum  in  (9.6)  as 


I  I  +  I  I  e‘bE.  1  +  II. 

j«n  k«j+l  n  3k  j=A+l  k=j+l  n  3k 


By  the  argument  connected  with  (8.5) ,  the  inner  sum  in  I  is  bounded  by 


(9.7)  l 
k*  j+1 


k=  j+1 


E  E.B.J 
n1  3  ]k 


=  0(1+  I  v(nA-A)  I )  . 


Thus,  by  the  bound  on  (H-n),  i  <.  o(l  +  Jv(nA-A)  |  2)  .  To  treat  II,  we  note  the 
following:  there  is  a  ®  such  that  for  H  <  j  <  k,  | |  <_  K2  exp  -a^fj-nJA 

Also,  for  k  >  j. 


£EF(y, z(kA) )  ^K2(exp  -a^^ (k-  j) A+l{  |  (exp  -a(k-j) A) v(  jA-A)  |  >  1 )  ] . 

With  a  little  more  work,  these  estimates  yield  the  existence  of  a  <  ®  such 

that  |£fEBj.  I  <  (l+0(  |  v(nA-A)  | )  K  exp  -a.  A  { ( j-n)  +  (k- j)  )/2 ,  from  which  the  fact 
n  jk  —  j  i 

that  II  *  0(1)  and  the  last  sentence  of  the  previous  paragraph  both  follow. 


It  is  straightforward  to  show  that 


-35- 


f£  (nt  +»  ) -f'  (nr)  »  -(sum  term  in  (0.5)) 
n  b  C> 


+  r_bN(uS  [  f  u  ( ,  nt  >  bN  ( )  ]u  l  KF(y.z(nA))F(y,z(0))  +  o(t)  [1+  |  v(nA-A)  |‘]  . 

n=l 

,  c  'M  r-6  l 

Summarizing,  with  f  (nt)  defined  by  f  (nt)  f(U^,n»)  +  i^-jf^lm),  we 

have 


(9.8) 


&k  f k  (nt+t )  -  f k  (nr) 


=  o(t')  11+  |  v(nA-A)  I  2)  + 


tf  (U*  , nt)  +  ef  (U',ni)F  (y)ULb„(UC) 
tn  u  n  y  n  N  n 


+  eb  (UE) [f  (UL ,nc)b 
N  n  u  n 


'XjF 

.  .yt .  fuu(Un,ne) 
+  ebN  UN  2 


U  )  7  EF(y,z(nA) )F(y,z(0) ) 

N  n  u  , 
n=l 

EF2 (y , z(0) ) +o(e)  [1+ | v(nA-A) | 2)  . 


Now,  if  the  {U£,N(‘))  (returning  to  the  use  of  superscript  N)  were  tight 
for  each  N,  then  (9.8)  and  Theorem  1  imply  that  any  weakly  convergent  sub¬ 
sequence  of  {ul,N(-)}  converges  to  a  diffusion  with  operator  AN,  whose  coeffic- 

'U  j 

ients  equal  those  of  A  in  S.,  and,  hence,  that  the  oriqinal  (U(-))  converge 

N 

weakly  to  the  solution  of  (6.3)  with  the  G  and  o  defined  in  Theorem  7. 

But  (dropping  the  superscript  N  again)  [ X ^ ( r»t_ ) j  =  0(/l)  ll+|v(nA-A)  j  ] 

and  |Eefe(ne+e)-fe(ne) |  =  0(e)  +  o(e) [l+(v(nA-A) | 2]  and  for  any  T  <  00  , 
n 

K  >  0,  the  Gaussian  property  implies  that 


lim  p{supr(v(ne)|2  £  K}  =  0. 
r-*0  n<T/r 


Thus,  tightness  follows  by  Theorem  7  of  [11  or  171,  an  it  did  for  the  case  of 


Theorem  1 • 


£> .  E .  D . 
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ANALYSIS  OF  NONLINEAR  STOCHASTIC  SYSTEMS  WITH  WIDE-BAND  INPUTS 
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Abstract 

A  general  method  for  approximating  systems  with  wide-band  inputs  by  diffusion 
processes  is  discussed.  The  input  noise  itself  can  be  processed  nonlinearly  by 
elements  of  the  system.  One  particular  case  is  examined  in  detail,  that  of  a 
hard  limiter  just  after  the  input.  The  results  suggest  that  the  limiter  can 
actually  improve  performance  if  the  noise  intensity  is  small.  The  development 
illustrates  how  tricky,  but  potentially  useful,  nonlinearities  can  be  handled 
when  the  system  input  is  wide-band  noise. 
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I.  Introduction 

Consider  a  (linear  or  nonlinear)  dynamical  system  with  a  wide-band  noise 
input.  It  is  often  of  considerable  interest  to  approximate  such  systems  by 
diffusion  models  so  that,  e.g.,  Markov  process  techniques  can  be  used.  In  II)  - 
[4],  17],  several  [owerful  methods  for  doing  this  have  been  developed. 

Roughly,  the  input  noise  process  is  parametrized  by  t  and  as  i  ■*  0,  the  bandwidth 
(BW)  -*•  ,  while  the  power  per  unit  BW  converges  to  a  constant.  The  limit  process 

is  found  via  methods  of  weak  convergence  theory.  The  methods  are  particularly 
useful  when  the  system  noise  (and/or  signal)  is  processed  nonl inearly ;  i.e., 
only  nonlinear  functions  of  the  noise  appear  in  the  dynamics.  Tie  problem  is 
often  not  what  the  so-called  correction  term  might  be,  but  what  the  entire 
form  of  the  limit  is,  and  this  is  not  usually  easy.  In  fact,  when  nonlinear 

functions  of  the  noise  appear,  the  notion  of  "correction  term"  loses  much  of  its  sense 
In  this  paper,  the  system  of  Figure  1  is  dealt  with. 

vE  =  F(ve)  +  Dy\ 

(1.1) 

e  ,  •  e  £  e  c.  e ,  .  r  „r 

y  =  Le  sign  u  ,  u  =  s  +  n  -  G(v  ) ,  v  (t)  t  R  , 

where  n£(-)  is  a  scalar-valued  wide-band  noise  input  process.  Conditions 

£ 

on  F ( • ) ,  n  (•)  and  G(-)  will  be  given  below.  The  main  result  is  that  as  t  +  0 
(BW  -*■  °»)  ,  the  measures  of  {vC(-)}  converge  to  those  of  v(-)  where  v(0  satisfies 
the  It6  equation 

(1.2)  dv  =  F(v)  dt  +  LD[(S~G(v)  )  (/2A)  dt  +  /2  In  2/a  dB], 

o 

where  B(-)  is  a  standard  Brownian  motion  and  L,  a,  a  will  be  defined  below. 

Roughly,  "a"  is  related  to  the  correlation  function  of  the  n£ ( • ) / (E | n£ ( t) | ^) , 

2  G 

and  a  is  the  intensity  of  the  spectrum  of  n  (•)  in  any  band  [0,BW]  for  small  r. 


If,  in  the  system  of  Fig.  1,  the  saturator  and  gain  I,  were  replaced  by  a 
gain  K,  then  the  limit  would  be  (follows  from  the  method  of  this  paper) 

(1.3)  dv  =  F(v)dt  4  KD  [  (s-G(v) ) df  4  /2H  db  ]  , 

2 

where  for  the  noise  model  used  for  (]'.i),  R  =  a  /a.  There  is  no  so-called 
"correction  term".  Owing  to  the  form  of  the  saturator  function,  the  formal 
technique  of  Stratonovich  is  inapplicable. 

The  example  is  offered  to  illustrate  what  can  be  done  with  one  particularly 
annoying  but  useful  nonlinearity.  The  basic  method  is  widely  applicable.  The 
scheme  is  unrelated  to  statistical  linearization,  which  in  fact  is  not  con¬ 
cerned  directly  with  approximating  processes. 

Before  proceeding.,  compare  (1.2)  and  (1.3)  for  the  case  when  the  feedback 
-G ( • )  is  supposed  to  be  stabilizing  (i.e.,  when  the  system  is  designed  to  make 
the  error  s ( t) -G (v£ ( t) )  small.  In  (1.2),  the  term  in  the  dynamics  which 
involves  the  error  is  proportional  to  1/0 ,  and  in  (1.3),  the  noise  term  is 
proportional  to  o.  Thus  for  small  o,  we  expect  the  limiter  to  enhance  stability 

without  increasing  the  noise  effects,  an  important  point  to  note.  For  large  o, 
the  limiter  does  not  seem  to  be  helpful.  A  simulation  comparison  of  the  "pre-limit” 
with  the  limit  for  a  somewhat  different  problem  (a  phase- locked  loop  with  a  satur¬ 
ator)  suggests  that  the  limit  (e  -+■  0)  results  are  often  the  "worst"  case,  in  that 
(for  example)  the  limit  mean  square  error  often  increased  to  the  limit  value  as 
e  -«•  0.  (They  also  suggest  that  often  the  limit  process  is  approached  quite  fast 
(as  measured,  say,  by  the  mean  square  value  of  the  input  to  the  limiter)  as  e  ■+  0.) 

He  do  not  know  the  extent  of  applicability  of  this  rule  -  but  it  seems  to  hold 
frequently.  When  it  does  hold,  the  limit  results  can  provide  useful  upper  bounds, 
and  system  improvements  suggested  by  the  form  of  the  limit  might  well  be  improvements 


for  the  “pre-limit"  case  also.  Unfortunately,  it  is  not  usually  possible  to  get 
approximate  diffusion  processes  where  the  BW  is  not  largo  -  so,  even  if  it  is  not 
large,  the  results  for  large  BW  might  bo  a  useful  guide  to  the  qualitative  behavior 


Reference  [5]  contains  some  applications  to  problems  in  communications 
theory  of  the  same  general  idea.  But  owinu  to  the  unbounded  nature  of  the  noise 
and  the  form  of  the  discontinuity  and  feedback,  the  problem  here  is  harder 
and  the  analytical  details  different. 

Section  II  gives  specific  assumptions.  The  main  background  theorem  and 
some  comments  on  weak  convergence  appear  in  Section  III,  and  the  convergence 
{ve(-)  }  -*■  v(-)  is  proved  in  Section  IV.  A  similar  method  would  be  used  with 
other  nonlinearities. 


II.  Model  Assumptions 

1.  The  noise  model.  Let  z(*)  denote  a  stationary  Gaussian  process  with 

2  |  1  c  £ 

correlation  function  a  exp  -a|T|,  a  >  0,  and  set  n  (t)  =  z  (t)/e,  where 

£  2  £  2 
z  (t)  =  z(t/e  ).  As  e  -»  0,  the  spectral  density  of  n  (•)  converges  to  2 a  /a 

on  any  finite  interval.  The  scaling  is  a  convenient  and  common  way  of  getting 

a  noise  process  n  (•)  whose  spectrum  converges  (as  e  -*•  0)  to  that  of  a  white 

noise  with  a  constant  power/unit  bandwidth.  For  other  correlation  functions 

the  /2  In  2  in  (1.2)  is  replaced  by  something  slightly  different.  We  use 

the  noise  form  only  to  facilitate  the  evaluation  of  the  coefficient  of  dB(*) 

in  (1.2).  The  Gaussian  assumption  simplifies  the  proof  that  certain  integrals 

converge  -  but  is  not  essential . 

2 .  The  limiter  gain  L  . .  If  l  ■'  L,  a  number  not  depending  on  e,  then 

as  r.  ►  0,  the  "increased  wildness"  of  n'  (-)  essentially  wipes  out  the  saturator  - 


replacing  it  by  an  open  circuit.  Thus  must  increase  as  e  decreases.  In 

any  particular  fixed  practical  system,  one  particular  value  of  will  be  used. 

But  as  the  bandwidth  -+■  this  value  of  L^  will  have  to  increase  (see  proof  in 

Section  IV)  and  eL^  will  have  to  converge  to  a  non- zero  number.  So  we  use 

L  =  L/c. 

£ 

3.  Other  assumptions.  G(-),  F(-)  are  continuously  differentiable  and 
the  solution  to  (1.2)  is  unique  in  the  weak  sense.  s(‘)  is  right  continuous 
and  uniformly  bounded  on  (0,®) .  The  method  is  most  easy  to  use  if  the  functions 
are  smooth .  The  analysis  will  be  done  with  g^ ( • )  replacing  g ( • )  =  sat ( • ) , 
where  the  piecewise  linear  ga(-)  is  defined  in  Fig.  2.  We  then  get  the  result 
(vG(*)}  v(-)  as  £  +  0,  then  a  ->•  0. 


III.  Weak  Convergence;  A  Convergence  Theorem 

Tightness .  Let  Dr[0,«)  denote  the  space  of  Rr-valued  functions  on  [0,®) 

which  are  right  continuous  and  have  left-hand  limits.  A  certain  topology 

called  the  Skorokhod  topology  ([6],  section  14)  is  usually  put  on  Dr.  The 

process  v  (•)  is  considered  to  be  a  random  variable  with  values  in  Dr[0,»)  and 

induces  a  measure  P£  on  it.  (P£}  or  {vE(*)}  is  said  to  be  tight  iff  for 
'  r* 

each  6  >  0  there  is  a  compact  K. £  D  [0,®)  such  that  P  (K.)  >  1-6,  all  e. 

o  e  o  — 

£  y, 

{v  (•)}  is  said  to  converge  weakly  to  a  process  v(')  with  paths  in  Dr[0,®)  and 
inducing  measure  P  on  it  iff  for  each  bounded  real-valued  function  g(*)  on 
Dr[0,<*>),  / q(u))dP£(u>)  -*•  / q(uj)dp(!i>)  as  e  0 .  Thus  weak  convergence  is  a 
generalization  of  convergence  in  distribution.  It  is  the  appropriate  form  of 
convergence  for  our  problem.  The  tightness  condition  for  {ve(0>  will  hold 


under  our  assumptions. 
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Truncated  processes.  The  actual  technical  proofs  of  tightness  and  weak 
convergence  are  easier  if  the  processes  (vL(-))  are  bounded.  Define 


...  .e,N  e,N.  e,N..  .  e,N. 

(3.1)  v  =  lF(v  )+Dy  ]b  (v  ), 

N 


t.  t  fN. 

y  =  L(  ga(u  )  , 


e,N  i  e,N. 

u  =  s  +  n  -  G(v  ) , 


where  b  (v)  =  1  for  v  g  S  =  {v:  |v|<n),  b  (v)  =  0  for  v  ^  Sk,  .  and  b„(v)  (E  [0,1) 
and  has  third  derivatives  that  are  bounded  uniformly  in  v  and  N.  If  we  can 
prove  convergence  for  {vE’N(0,  e-K)}  for  each  N,  then  Theorem  1  says  that  we 
can  prove  it  for  (1.1).  Thus,  the  truncation  is  purely  technical  and  does  not 
affect  the  result. 

Definitions.  Let  A  denote  the  infinitesimal  operator  of  the  diffusion  (1.2). 
Let5*^'N  denote  the  a-algebra  induced  by  {vE,N(s)  ,nE  fs)  ,s<t)  and  eE,N  the  cor¬ 
responding  conditional  expectation.  Actually and  EE,N  depend  on  a  also. 

But  we  usually  suppress  the  a  affix.  Let  be  the  class  of  measurable  (w,t) 

functions  such  that  if  g(*  )G  then  Ejg(t+6)-g(t) |  +  0  as  i  H  and 

sup  E | g ( t)  |  <  00  and  g(t)  depends  only  on  {vE  ,N(s)  ,nE  (s)  ,s<t} .  We  say  p-lim^^  f^  =» 

0  iff  sup.  Ctl  1  4L and  E|f*^(t)  |  +  0  as  {  +  0.  Define  an  operator  AE,N  and  its 
u ,  t 

domain  i^(AE'N)  as  follows:  g£  .<S?(AE,N)  and  AE,Ng  =  q  iff  g^Gi#  and 
EE,Ng(t+6)-g(t) 

p-lim  e|— = - - -  -  q(t)  I  =  0. 

6^0  6 

The  following  theorem  is  Theorem  1  of  [21,  adapted  to  our  case.  i?Q  denotes 
the  set  of  continuous  real-valued  functions  on  Rr  *  10,®) . 
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Theorem  1.  Let  the  equation  (1.2)  have  a  unique  weak-sense  solution.  Fix  N. 
For  each  f ( • )  €  &  ,  a  dense  set  (sup  norm)  in  ,  let  there  be  a  sequence 
(fe 'N( * ) } £  satisfying  the  following: 


(3.2) 


(3.3) 


p-lim  |fC,N(t)-f(v‘-'N(t)  ,t)  |  0, 

e-K) 

(V*0 


fc'V)  £  ^(At,N)  , 


e,N. 


(3.4)  p-lim  |Ae'Nfe,N(t)-(AN-+|-)  f(vE  ,N(t)  ,t)  |  =  0, 
e-K)  3t 

a-K) 

.N  . 


where  A  is  the  infinitesimal  operator  of  some  diffusion  process  and  the  coeffic¬ 
ients  of  AN  and  A  are  equal  for  v  £  .  Then  if  { vE ' N ( • ) }  is  tight  for  each  N , 

(vE(*)}  -*■  v(-)  weakly . 


Comment .  Tightness  is  not  hard  to  prove  here.  See  comments  at  the  end 
of  the  proof  of  Theorem  2,  which  applies  Theorem  1  to  our  case  (1.1) .  Given 
f(*,*),  the  main  problem  is  to  find  the  fe'N(*)  and  to  verify  (3.2)  -  (3.4) 

(and  ultimately  to  prove  tightness).  The  method  used  here  and  in  fl],  [2 ]  is 
similar  to  the  averaging  method  used  in  (3).  We  choose  the  form  fe,N(t)  * 
f(ve,N(t),t)  +  fE,N(t)  +  fE,N(t),  where  fE,N(t)  is  chosen  so  as  to  "average  out" 
certain  noise-dependent  terms  in  A  '  f(v  '  (t),t),  and  f  '  (t)  is  chosen  to 

A  £  N 

"average  out"  certain  noise-dependent  terms  which  result  from  applying  A  ' 

to  f  '  (t) .  In  the  proof  lim  .  means  lim  _  lim  .. 

1  e-K)  e-K)  a-K) 

a-K) 
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IV.  The  Convergence  Theorem 


Theorem  2 .  Under  the  assumptions  in  Section  II,  { vL ( • ) }  converges  weakly  to 
v  ( • )  as  e  •*  0  and  then  it  -»  0 . 

Proof .  Let  ^  =  %  Q'  >  the  subspace  of  of  functions  whoso  mixed  i>artial 
derivatives  up  to  order  2  in  t  and  3  in  x  are  continuous.  By  Theorem  1,  for 

each  N  and  f(-,*)£  ,  we  only  need  to  find  {fL'N(-)}  satisfying  (3.2)  -  (3.4). 

t  N  e 

For  notational  convenience  write  v  '  ( ■ )  as  v  ( • )  in  this  proof,  but  we  are  always 

C  N 

working  with  the  truncated  process  v  '  (*). 

Part  1.  Fix  f(*,*)  E  .  Then 

(4.1)  AC,Nf(vE(t) ,t)  -  ft(vC(t),t)  + 

+  b„(v€(t))f'(ve(t),t)  •  [F  (vE  (t) )  +  —  g  (s(t)+ne(t)-G(ve(t)))]. 

N  v  E  a 

Note  that  for  u  in  any  bounded  set 

(4.2)  ^£g  (u+nE (t) )  gE(u)/e  =  ^-[P{z(0)  >-eu+ea}-p{z(0)  <-eu-ea}  +  0(ea)/e] 

=  /T/ii  ~  +  0(a)  +  0(c) , 

€  N 

which  justifies  the  L£  =  L/e  scaling.  We  will  get  f  '  (•)  in  the  form 

fE'N(t)  -  f(ve(t),t)  +  fE'N(t)  +  fE,N(t), 

c  N 

where  the  f  '  (•)  will  be  defined  below. 
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The  following  estimate  will  be  used. 

(4.3)  On  the  set  {|z(t)|<l}  or  even  on  { |  z(t)  |^eal//2} , 

-at 

|p{z(t+t)  C  B  |  z(t) }  -  P { z ( t+T )  (  B}|  ^  Ce  for  some  constants  C  and 
a^  >  0,  uniformly  in  B.  Similarly,  on  the  same  z(t)  set  and  for 

T2  "  T1  >  °' 

~>*.T 

Ipfzft+tJC  B^,  i=l,2|z(t)}  -  P{z(ti+t)r  i*l,2l|<Ce  for  some 

al  >  0  and  c  <  “>  and  all  B  ,  B2> 


In  the  sequel  the  values  of  a^  and  C  may  change  from  usage  to  usage. 

—  £  £  £ 

Define  g  (u,n  (t))  =  g  (u+n  (t))  -  Eg  (u+n  (t))  and  define 
a  u  a 

00 

f£,N(t)  =  -  b„(vc(t)) [ f ■ (ve(t) ,t+T)DE^'N5  (s(t+T)-G(ve(t)) ,ne(t+T))dT 
1  e  N  J  v  t  "’a 

0 

00 

=  eLb  (vG  (t) )  [f  •  (ve(t)  ,t+e2T)DE^'Ng  (s (t+r2x) -G(v£  (t)  ),z(^-+x) /e)  d t 

N  i  V  t  01  Z 

o  e 

By  (4.3),  f£,N(t)  =  0(e)  uniformly  in  w,  a, on  the  set  {  |  z ( t/ e 2 )  |  <^1 } . 
Define  =  min{T:  e  aT^2 1  z(t/e2)  |<^1} .  Write  f£'N(-)  as 

w^  « 

f^,N(t)  *  ej  E£,N{-}  dt  +  ej  E£,N{.}  dx • 

0  w^ 

The  first  term  is  bounded  in  absolute  value  by  eCw^^  and  the  integrand  of  the 
second  by  C  exp  -a^x.  Thus 


(4.4)  |  f£  ,N(t)  |  <  Ce  (1+w^)  <_  Ce  (1  +  max(0,log  j  z£  (t)  | )  ) . 


I 


J 
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Part  2.  It  can  be  verified  that  fE,N(*)  £  ^(ACrN)  and  that 


(4.5)  AE'Nf^'N(t) 


Lb  (v  (t) ) 

— — - f  Mvl  *  t)Dg^  (s  (t)-G(vE  (t)),nE  (t) ) 


—  [e*  ,N[bM(vE (t)) f ’ (v*  (t) ,t+x)Dg  (s (t+l ) -G(v‘  ( t) ) .n*  (t+x) ) ]  vl(t), 
i  v 


where  the  subscript* v  denotes  the  gradient  of  the  bracketed  expression  with 

respect  to  vE(t).  At  this  point,  let  us  simplify  the  notation  by  dropping  the 

G  N  G 

b  (v)  terms.  All  of  the  f .  '  will  be  proportional  to  either  b  (v  (t))  or 
N  -  i  N 

2  c  2 

b  (v  (t)).  Changing  variables  t/c  ■*  x  and  splitting  the  integral  in  (4.5) 

N 

*  G  G  G 

into  two  parts  and  using  v  =  b  (v  ) [s-G(v  ) +Dg  /e]  (but  dropping  the  b  (v) ) 

N  01  N 

yields 


(4.6)  L2jEE'N(D,fv(vE(t)  ,t+e2T))^Dgn(s(t+e2T)-G(vt'(t))  ,z(^+t)/c)  • 

0  E 

g  (s(t)-G(vE (t) )+z(t/c2)/e)dx  +  0(c) 
a 

00 

+  L2|EE'N(D’f  (ve(t)  ,t+e2T))i’  (s(t+e2x)-G(vC  (t)  ,z(~H-x)/e)D  • 
J  t  v  (*,v  2 

0  E 
ga(s(t) -G(vE (t) )+z(t/e2) /e) dx  +  0(e). 


The  terms  in  (4.6)  exist  by  the  same  arguments  which  led  to  (4.4).  We 

next  show  that  the  second  integral  of  (4.6)  is  negligible  as  £  +  0,  o  +  0  and 

get  an  estimate  which  is  useful  for  the  tightness  argument.  The  facts  that 

s(t)  and  vC(t)  are  bounded  (recall  that  we  are  using  the  truncated  process 

(3.1))  and  that  the  support  of  g  (u)  is*  in  (-a, a)  and  that  |g  (u)  |  <  C/a 

f  11  Cl  f  u 

*go  is  the  derivative  of  9a(*J  with  respect  to  its  argument.  The  subscript  v 

denotes  the  derivative  with  respect  to  the  explicit  argument  vs  replace  vC(t>>*Vtake 
the  derivative  with  respect  to  v,  then  set  v  •  vE(t).  j 

J 


will  be  used  frequently  and  perhaps  without  specific  mention.  Let  i{a)  denote 
the  indicator  function  of  the  set  A. 


By  (4.3)  it  can  be  verified  that 

(4.7)  Y  =  |E^'Ng^  v<s-G(v)  ,z(^+r)/(  )  |  <  iexp  -a^r  +  I  (  j  z  ( t/r  >eaT^2 )  )C/t . 

'  t 

We  need  a  bound  on  Y  which  goes  to  zero  as  e  -*■  0.  First  we  get  such  a  bound 
when  |z(t/e  )  |  >.  1.  Note  that 

(4.8)  P{  |s-G(v)+z(^-*T)/e|  £  a  z(t/e2) =zQ)  =  O(ut) 

uniformly  for  | zQ |  _>  1  and  T  >  0  (recall  that  s,  v  are  in  a  bounded  set  - 

for  each  N) .  Now,  (4.8)  and  the  facts  cited  above  (4.7)  imply  that  Y  is  bounded 

2  2 

by  0(e)  ,  uniformly  in  |  z(t/e  )  |  >_  1,  x  >  0.  Thus  on  |  z(t/e  )  |  _>  1 , 

(4.9)  Y  _<  [exp  -a^T  +  l{  |  z(t/e2)  |  >eaT//2}  ]1//2C(c/ct) 1^2 

(use  | x |  <_  a,  |x|  <_  b  |x|  £  ^ab)  .  Thus,  on  integrating  the  bound  when 
2 

|z(t/e  ) |  >_  1,  we  see  that  the  second  term  of  (4.6)  is  bounded  above  by 


C [1  +  max(0,  log | z(t/e2) | ) ]  (t/a)1^2. 

2 

Now,  we  look  for  a  bound  when  |z(t/e  ) |  <  1.  Split  the  second  integral  in 

e  * 

(4.6)  into  the  two  parts  /  +  /.  The  first  part  is  0(e/o) .  Note  that  the 

t  °  E  2 

density  of  z(— j*x),  x  >  e,  conditioned  on  any  value  of  |z(t/e  j|  in  [0,1),  is 

bounded  above  by  0(l//c) .  So  (4.8)  then  holds  with  0(ea)  replaced  by  0(e*^2a) . 


i 


Confining  this  estimate  with  (4.7)  yields  that  Y  is  bounded  above  by  (4.9)  when 


1/4  ,1/2. 


|z(t/e2)|  <_  1,  but  with  the  change  that  (e/o)  is  replaced  by  (e  /a  >  in 
(4.9).  Thus,  on  integrating  the  bound,  we  get  that  the  second  term  of  (4.6)  is 
bounded  above  by 


(4.10)  C  (1  ♦  max(0,  log  |  z(  t/t  2)  | )  1 1  . 


Part  3.  We  turn  our  attention  to  the  first  term  of  (4.6)  and  show  that, 
by  an  "averaging",  it  can  effectively  be  replaced  by  its  expectation.  To  facil¬ 
itate  the  development,  we  define  the  following  terms. 


h  (v,t,t,p)  =  L2D'f  (v, t+t+p)D*g  (s (t+T+p)-G(v) +nE (t+t+p) )  • 


g  (s (t+p) -G(v) +n  (t+p)), 
a 


H  (v,t,T,p)  =  L2D'fvv<v,  t+e^t+e^p)  D-ga(s+(t5*-e*T+c*p)  -G(v)  +z(-~ +t+p)  /e) 


2  ,  2 


2  .  2 


•  g  (s(t+e2p)-G(v)+z(-~ +p)/e)  , 
«  e2 


(4.11) 


(4.12) 


AE,Nf(v,t)  H  EH£(v,t,T,0)dT  =  ^2  |Eh£ (v, t,T ,0) dr . 

0  e  0 
00  00 

fE,N(t)  s  [dp  (  dt[EE,Nh  (vE(t) ,t,t,p)  -  Eh  (v,t,t,p)  ] 

2  2  J  J  t  €  E  v=vE(t) 

0  0 
oo  00 

=  C2  fdp  f  dT[EE,NH  (vE (t) ,t,t ,p)  -  EH  (v,t,x,p)  ), 

J  J  £  e  v-vE(t) 


v=vE ( t) 


0  0 


where 


implies  that  v  is  replaced  by  v  (t)  after  taking  the  expectation 


v  v  ( t) 


We  must  show  that  f '  (*)  is  well  defined.  First  note  that  the  inner  integral 
t  N 

of  f^'  (t)  (with  p  =  0  and  a  chanqe  of  variable:;)  is  just  the  first  term  of 

i  ^ 

(4.6)  centered  about  its  expectation.  Win  form  of  f  '  (•)  is  cliosen  to  allow 

us  to  average  out  the  first  term  ot  (4.t>)  and  to  effectively  replace  it  by  its 
average  value  A^ ' Nf ( v*  ( t ) , t)  . 

By  the  method  used  to  bound  |f^'N(t)j,  we  con  get  that  the  inner  integral 
of  f^’^(t)  exists  for  each  p.  Recall  the  definition  -  miriiw:  e  aW,/'t  |  z  ( t/f. 2) 
and  write  (4.12)  as 


ao  cx.* 

dp  [  dx  E^N 

J 


W^  ^ 

B  +  e2  f  dp  dt  E^*NB  II  +  1  . 


w^  0 


First  we  show  that  II  is  well  defined.  By  (4.3)  and  the  definition  of  ,  the 
absolute  value  of  the  integrand  in  II  is  bounded  above  oy  C  exp  -a^p,  a^  >  0 . 
Also  (eH^ (v, t,t ,p) |  £C  exp  -a^x  for  some  a^  >  0.  By  (4.3)  and  on  the  set 
{p>w1),  and  for  C,  a^  (whost'  values  again  may  change  from  usage  tc>  usage) 


|e^'NH  (v,t,T,p) I  <  F^'N|eL,N  H  (V.t,T,p) 
t  £  —  t  2  ♦ 

t+t  0 


<  C  Eet'N  lexp  -a ^ 


t  ♦  I  { e  °  /2I  i(Vt')  i>l)l 


*  C  exp  -a^T  ♦  CP ( | z (— j+p )  | >e* T ^ 2 |  p^w^ ; z ( t/t 2) ) 

t 

<  C  exp  -a^T  +  Ce  aT'/2E<  |  z(^yt-p)  |  p>w^  iz(t/e2)  } 


<  C  exp  -a  i . 


..  -vtc-ar. 
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Chobychev's  inequality  is  used  to  qet  the  next-to-last  inequality.  Combining 

the  above  estimates  yields  that  the  integrand  in  II  is  bounded  by  (for  sane  a^  > 

C  v  «>)  C  exp  -a^(T+p).  Hence  TI  •-  0(t2). 

2  2 

The  term  I  is  also  0(c  )  but  not  uniformly  in  z(t/t  ).  Bound  the  inner 
integral  of  I  by 


00 


Oo 


’ 

. 

0 


dt 


e  ,N 
Et 


dt 


,K,N 


t+£ 


2 

p 


B| 


III . 


By  the  arguments  used  to  get  the  bound  on 


i ,N(t) | ,  we  get 


III  <  Ce2  E^'N[1  +  max(0,  logjz(^+p)  |)  ] 

€ 


<  C,2  Elt'N(l  +  log(  |  z(™*p)  |  +  1)3. 

t 


By  Jensen's  inequality  and  the  concavity  of  log('), 


III  <  Cc 2  [1  +  log(|z(^)  |  +  C)  J  . 

e 

2 

S ince  w1  <  C  max ( 0 ,  log | z ( t/c  ) | ) , 

(4.13)  |  f^  ,N(t)  |  <  Cc 2  [1  +  log(|z(^)  |  +  C)  J2. 

f 

Henceforth,  we  will  give  only  an  outline  of  the  details,  which  can  all 
be  filled  in  via  the  estimates  and  techniques  developed  above.  It  can  be  shown 
that  f 2 ' N ( • )  G  i^(AL,N)  and  that 
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(4.14)  At,Nf2*N(t)  -  negative  of  first  term  of  (4.G)  +  ' Nf (vL ( t) , t) 

t  (terms  whose  p-lim  equal  zero), 
i  K) 
a  HI 

The  term  whose  p-lim  =  0  is  just  (f"'  (t)) ‘v'(t),  where  f„'  is  the  gradient 

2 ,  v  2 ,  v 

C  N  1 

of  the  expression  for  f  '  with  respect  to  the  argument  v  (t).  The  components 
of  A  '  f  '  which  involve  f^^  ate  bounded  by  0(c) .  Loosely  speaking,  the 
remaining  component  is  of  the  form 


(4.15)  o(e)  +  e 


(Et,Nf  g  g  +  E^'Nf  g  g  -  E  f  g  g 
t  w  a,  v  ct  t  w  a  n,v  w  a,v  a 


-  E 


f  g  g  ]didp 
w  a  a,v 


where  we  omit  the  function  arguments.  By  a  method  similar  to  that  used  to  get 

2 

(4.13),  we  get  the  bound  (4.13)  on  (4.15)  but  with  (e/a)  replacing  r.  . 


Part  4 .  The  estimates  obtained  in  Parts  1-3  imply  that 


p-lim  |fE(t)-f(v  (t),t)|  =  0, 
e-*0 
a-*0 


p-lim 

e-*0 

a-K) 


|AE,Nfe(t)-f  (vG(t)  ,t)  -L/2A  (s(t)~G(v  (t))J  d  1  f  (ve 
1  t  o  v 


(t)  ,t) 


-  AG,Nf(vt(t) ,t) I  =  0. 


A  proof  very  similar  to  that  in  ([5],  Section  6,  part  2)  yield^  that  A^'Nf(v,t)  -*• 

D’  f^tv#  t) D(ln  2)/a  uniformly  in  v  for  each  t.  In  calculating  the  limit,  the 

*One  of  the  reasons  for  the  choice  cov[z (0) ,z (t) ]  -  o2exp  -at  is  to  allow  us  to  save 
work  by  using  this  result.  The  choice  allowed  an  explicit  evaluation  of  ths  diffusion 
term.  With  other  choices  the  diffusion  coefficient  would  be  left  in  an  "integral"  form. 
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G ( • )  ,  s(-)  play  no  role  and  the  limit  (e  -+■  0,  a  -*•  0)  is  the  same  as  for  the 

case  (a  =  0,  e  -*•  0)  .  If  the  b  (v)  terms  were  retained,  the  result  would  be  the 

N 

2 

same,  except  that  either  b  or  b  would  multiply  the  f  ,  f  or  f  .By  what 

N  N  v  w  vw 

has  just  been  said 

(4.16)  p-lim  |Ae,Nfe,N(t)  -  (|-  +  A) f (vL (t) ,t) |  =0, 
e-K)  3t 

a-K> 

where  A  is  the  infinitesimal  operator  of  v(-)  in  (1.2)  .  If  b^  were  retained, 

N 

the  A  in  (4.16)  would  be  replaced  by  some  A  which  would  equal  A  where  bN(-)  ~  1* 
i.e.  in  S  .  Thus,  by  Theorem  1,  if  {vE,N(-))  were  tight,  then  the  proof  would 
be  completed. 


Tightness .  Use  ([2],  Theorem  2). The  conditions  of  Theorem  2  [2]  hold  if  (4.17) 
holds  for  each  N  and  T  <  °°: 


(4.17)  lim  lim  p{sup|A  '  f  '  (t) |  >  K)  =  0, 

K-*»  e-K)  t£T 
a-K) 

lim  P{sup  | fE,N(t)+fE,N(t) |  >  6)  =  0,  each  6  >  0. 

e-K)  t<T  1 

a-K> 

But  (4.17)  follows  from  (4.10),  (4.13)  (and  a  similar  estimate  for  (4.15),  and 
the  fact  that  the  Gaussianness  and  stationarity  imply  that  (<.  r  .  *  ^  1  ’’ 


a 

lim  sup  e^|z(t/e  )j  =  0  w.p.  1.  Q.E.D. 

e-K)  t<T 
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